Connecting Phrasal and Rhythmic Events: Evidence from Second Language Speech
ERIC Educational Resources Information Center
Nava, Emily Anne
2010-01-01
This dissertation investigates the relation between prosodic events at the phrasal level and component events at the rhythmic level. The overarching hypothesis is that the interaction among component rhythmic events gives rise to prosodic patterns at the phrasal level, while at the same time being constrained by the latter, and that in the case of…
What Do You Mean by That?! An Electrophysiological Study of Emotional and Attitudinal Prosody.
Wickens, Steven; Perry, Conrad
2015-01-01
The use of prosody during verbal communication is pervasive in everyday language and whilst there is a wealth of research examining the prosodic processing of emotional information, much less is known about the prosodic processing of attitudinal information. The current study investigated the online neural processes underlying the prosodic processing of non-verbal emotional and attitudinal components of speech via the analysis of event-related brain potentials related to the processing of anger and sarcasm. To examine these, sentences with prosodic expectancy violations created by cross-splicing a prosodically neutral head ('he has') and a prosodically neutral, angry, or sarcastic ending (e.g., 'a serious face') were used. Task demands were also manipulated, with participants in one experiment performing prosodic classification and participants in another performing probe-verification. Overall, whilst minor differences were found across the tasks, the results suggest that angry and sarcastic prosodic expectancy violations follow a similar processing time-course underpinned by similar neural resources.
Männel, Claudia; Schaadt, Gesa; Illner, Franziska K; van der Meer, Elke; Friederici, Angela D
2017-02-01
Intact phonological processing is crucial for successful literacy acquisition. While individuals with difficulties in reading and spelling (i.e., developmental dyslexia) are known to experience deficient phoneme discrimination (i.e., segmental phonology), findings concerning their prosodic processing (i.e., suprasegmental phonology) are controversial. Because there are no behavior-independent studies on the underlying neural correlates of prosodic processing in dyslexia, these controversial findings might be explained by different task demands. To provide an objective behavior-independent picture of segmental and suprasegmental phonological processing in impaired literacy acquisition, we investigated event-related brain potentials during passive listening in typically and poor-spelling German school children. For segmental phonology, we analyzed the Mismatch Negativity (MMN) during vowel length discrimination, capturing automatic auditory deviancy detection in repetitive contexts. For suprasegmental phonology, we analyzed the Closure Positive Shift (CPS) that automatically occurs in response to prosodic boundaries. Our results revealed spelling group differences for the MMN, but not for the CPS, indicating deficient segmental, but intact suprasegmental phonological processing in poor spellers. The present findings point towards a differential role of segmental and suprasegmental phonology in literacy disorders and call for interventions that invigorate impaired literacy by utilizing intact prosody in addition to training deficient phonemic awareness. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.
When emotional prosody and semantics dance cheek to cheek: ERP evidence.
Kotz, Sonja A; Paulmann, Silke
2007-06-02
To communicate emotionally entails that a listener understands a verbal message but also the emotional prosody going along with it. So far the time course and interaction of these emotional 'channels' is still poorly understood. The current set of event-related brain potential (ERP) experiments investigated both the interactive time course of emotional prosody with semantics and of emotional prosody independent of emotional semantics using a cross-splicing method. In a probe verification task (Experiment 1) prosodic expectancy violations elicited a positivity, while a combined prosodic-semantic expectancy violation elicited a negativity. Comparable ERP results were obtained in an emotional prosodic categorization task (Experiment 2). The present data support different ERP responses with distinct time courses and topographies elicited as a function of prosodic expectancy and combined prosodic-semantic expectancy during emotional prosodic processing and combined emotional prosody/emotional semantic processing. These differences suggest that the interaction of more than one emotional channel facilitates subtle transitions in an emotional sentence context.
Criteria for Labelling Prosodic Aspects of English Speech.
ERIC Educational Resources Information Center
Bagshaw, Paul C.; Williams, Briony J.
A study reports a set of labelling criteria which have been developed to label prosodic events in clear, continuous speech, and proposes a scheme whereby this information can be transcribed in a machine readable format. A prosody in a syllabic domain which is synchronized with a phonemic segmentation was annotated. A procedural definition of…
ERIC Educational Resources Information Center
Vion, Monique; Colas, Annie
2009-01-01
This study deals with the determinants of prosodic phrasing in French schoolchildren's narratives. Children (aged 7 to 11) told picture stories to a silent same-age peer. The establishment of temporal and/or causal relations between the events was more or less guided by the drawings (ordered vs. arbitrary sequences). The comprehension of the…
White matter pathways for prosodic structure building: A case study.
Sammler, Daniela; Cunitz, Katrin; Gierhan, Sarah M E; Anwander, Alfred; Adermann, Jens; Meixensberger, Jürgen; Friederici, Angela D
2018-05-11
The relevance of left dorsal and ventral fiber pathways for syntactic and semantic comprehension is well established, while pathways for prosody are little explored. The present study examined linguistic prosodic structure building in a patient whose right arcuate/superior longitudinal fascicles and posterior corpus callosum were transiently compromised by a vasogenic peritumoral edema. Compared to ten matched healthy controls, the patient's ability to detect irregular prosodic structure significantly improved between pre- and post-surgical assessment. This recovery was accompanied by an increase in average fractional anisotropy (FA) in right dorsal and posterior transcallosal fiber tracts. Neither general cognitive abilities nor (non-prosodic) syntactic comprehension nor FA in right ventral and left dorsal fiber tracts showed a similar pre-post increase. Together, these findings suggest a contribution of right dorsal and inter-hemispheric pathways to prosody perception, including the right-dorsal tracking and structuring of prosodic pitch contours that is transcallosally informed by concurrent syntactic information. Copyright © 2018 Elsevier Inc. All rights reserved.
Sridhar, Vivek Kumar Rangarajan; Bangalore, Srinivas; Narayanan, Shrikanth S.
2009-01-01
In this paper, we describe a maximum entropy-based automatic prosody labeling framework that exploits both language and speech information. We apply the proposed framework to both prominence and phrase structure detection within the Tones and Break Indices (ToBI) annotation scheme. Our framework utilizes novel syntactic features in the form of supertags and a quantized acoustic–prosodic feature representation that is similar to linear parameterizations of the prosodic contour. The proposed model is trained discriminatively and is robust in the selection of appropriate features for the task of prosody detection. The proposed maximum entropy acoustic–syntactic model achieves pitch accent and boundary tone detection accuracies of 86.0% and 93.1% on the Boston University Radio News corpus, and, 79.8% and 90.3% on the Boston Directions corpus. The phrase structure detection through prosodic break index labeling provides accuracies of 84% and 87% on the two corpora, respectively. The reported results are significantly better than previously reported results and demonstrate the strength of maximum entropy model in jointly modeling simple lexical, syntactic, and acoustic features for automatic prosody labeling. PMID:19603083
Männel, Claudia; Friederici, Angela D
2016-02-01
Children׳s perception of prosodic phrasing provides a head start into the discovery of speech structure. Based on the close prosody-syntax correspondence, children can infer the underlying syntactic structure from the acoustic modulations of prosodic boundaries, typically consisting of co-occurring pitch changes, preboundary lengthening, and pausing. Previous electrophysiological studies revealed that listeners are to some degree flexible in the detection of major prosodic boundaries that are not marked with all three of the suprasegmental cues. Adults and 6-year-olds still showed the brain response for prosodic boundary perception, the Closure Positive Shift (CPS), when pauses marking boundaries were deleted. In contrast, younger children at 3 years did not show this ability yet, but required pausing to complement the other boundary cues. Following the hypothesis that German weights duration cues more heavily than pitch cues, we here examined 3-year-olds׳ brain responses to prosodic phrasing, testing the role of boundary-related pitch changes. Results revealed that children at this age even showed the CPS in response to pitch-neutralized boundaries with only pausing and preboundary lengthening being present. These results indicate differential roles of acoustic cues in boundary perception, with a preferential reliance on duration cues over pitch changes in 3-year-olds. This preference likely results from the characteristics of the German intonation system and furthers the discussion of cross-linguistic differences in the weighting of prosodic boundary cues. Copyright © 2015. Published by Elsevier B.V.
Detection of target phonemes in spontaneous and read speech.
Mehta, G; Cutler, A
1988-01-01
Although spontaneous speech occurs more frequently in most listeners' experience than read speech, laboratory studies of human speech recognition typically use carefully controlled materials read from a script. The phonological and prosodic characteristics of spontaneous and read speech differ considerably, however, which suggests that laboratory results may not generalise to the recognition of spontaneous speech. In the present study listeners were presented with both spontaneous and read speech materials, and their response time to detect word-initial target phonemes was measured. Responses were, overall, equally fast in each speech mode. However, analysis of effects previously reported in phoneme detection studies revealed significant differences between speech modes. In read speech but not in spontaneous speech, later targets were detected more rapidly than targets preceded by short words. In contrast, in spontaneous speech but not in read speech, targets were detected more rapidly in accented than in unaccented words and in strong than in weak syllables. An explanation for this pattern is offered in terms of characteristic prosodic differences between spontaneous and read speech. The results support claims from previous work that listeners pay great attention to prosodic information in the process of recognising speech.
Finding intonational boundaries using acoustic cues related to the voice source
NASA Astrophysics Data System (ADS)
Choi, Jeung-Yoon; Hasegawa-Johnson, Mark; Cole, Jennifer
2005-10-01
Acoustic cues related to the voice source, including harmonic structure and spectral tilt, were examined for relevance to prosodic boundary detection. The measurements considered here comprise five categories: duration, pitch, harmonic structure, spectral tilt, and amplitude. Distributions of the measurements and statistical analysis show that the measurements may be used to differentiate between prosodic categories. Detection experiments on the Boston University Radio Speech Corpus show equal error detection rates around 70% for accent and boundary detection, using only the acoustic measurements described, without any lexical or syntactic information. Further investigation of the detection results shows that duration and amplitude measurements, and, to a lesser degree, pitch measurements, are useful for detecting accents, while all voice source measurements except pitch measurements are useful for boundary detection.
Automatic measurement and representation of prosodic features
NASA Astrophysics Data System (ADS)
Ying, Goangshiuan Shawn
Effective measurement and representation of prosodic features of the acoustic signal for use in automatic speech recognition and understanding systems is the goal of this work. Prosodic features-stress, duration, and intonation-are variations of the acoustic signal whose domains are beyond the boundaries of each individual phonetic segment. Listeners perceive prosodic features through a complex combination of acoustic correlates such as intensity, duration, and fundamental frequency (F0). We have developed new tools to measure F0 and intensity features. We apply a probabilistic global error correction routine to an Average Magnitude Difference Function (AMDF) pitch detector. A new short-term frequency-domain Teager energy algorithm is used to measure the energy of a speech signal. We have conducted a series of experiments performing lexical stress detection on words in continuous English speech from two speech corpora. We have experimented with two different approaches, a segment-based approach and a rhythm unit-based approach, in lexical stress detection. The first approach uses pattern recognition with energy- and duration-based measurements as features to build Bayesian classifiers to detect the stress level of a vowel segment. In the second approach we define rhythm unit and use only the F0-based measurement and a scoring system to determine the stressed segment in the rhythm unit. A duration-based segmentation routine was developed to break polysyllabic words into rhythm units. The long-term goal of this work is to develop a system that can effectively detect the stress pattern for each word in continuous speech utterances. Stress information will be integrated as a constraint for pruning the word hypotheses in a word recognition system based on hidden Markov models.
Baumann, Stefan; Schumacher, Petra B
2012-09-01
The paper reports on a perception experiment in German that investigated the neuro-cognitive processing of information structural concepts and their prosodic marking using event-related brain potentials (ERPs). Experimental conditions controlled the information status (given vs. new) of referring and non-referring target expressions (nouns vs. adjectives) and were elicited via context sentences, which did not - unlike most previous ERP studies in the field--trigger an explicit focus expectation. Target utterances displayed prosodic realizations of the critical words which differed in accent position and accent type. Electrophysiological results showed an effect of information status, maximally distributed over posterior sites, displaying a biphasic N400--Late Positivity pattern for new information. We claim that this pattern reflects increased processing demands associated with new information, with the N400 indicating enhanced costs from linking information with the previous discourse and the Late Positivity indicating the listener's effort to update his/her discourse model. The prosodic manipulation registered more pronounced effects over anterior regions and revealed an enhanced negativity followed by a Late Positivity for deaccentuation, probably also reflecting costs from discourse linking and updating respectively. The data further lend indirect support for the idea that givenness applies not only to referents but also to non-referential expressions ('lexical givenness').
Coordination of Prosodic Gestures at Boundaries in Greek
ERIC Educational Resources Information Center
Katsika, Argyro
2012-01-01
This dissertation investigates how boundary temporal and tonal events are coordinated to oral constrictions in Greek. Regarding the temporal events, most studies agree in that boundary lengthening is cumulative (i.e., larger the stronger the boundary) (e.g., Cho & Keating 2001, Tabain 2003b) and progressive (i.e., decreasing with distance from…
ERIC Educational Resources Information Center
Baumann, Stefan; Schumacher, Petra B.
2012-01-01
The paper reports on a perception experiment in German that investigated the neuro-cognitive processing of information structural concepts and their prosodic marking using event-related brain potentials (ERPs). Experimental conditions controlled the information status (given vs. new) of referring and non-referring target expressions (nouns vs.…
Language and music phrase boundary processing in Autism Spectrum Disorder: An ERP study.
DePriest, John; Glushko, Anastasia; Steinhauer, Karsten; Koelsch, Stefan
2017-10-31
Autism spectrum disorder (ASD) is frequently associated with communicative impairment, regardless of intelligence level or mental age. Impairment of prosodic processing in particular is a common feature of ASD. Despite extensive overlap in neural resources involved in prosody and music processing, music perception seems to be spared in this population. The present study is the first to investigate prosodic phrasing in ASD in both language and music, combining event-related brain potential (ERP) and behavioral methods. We tested phrase boundary processing in language and music in neuro-typical adults and high-functioning individuals with ASD. We targeted an ERP response associated with phrase boundary processing in both language and music - i.e., the Closure Positive Shift (CPS). While a language-CPS was observed in the neuro-typical group, for ASD participants a smaller response failed to reach statistical significance. In music, we found a boundary-onset music-CPS for both groups during pauses between musical phrases. Our results support the view of preserved processing of musical cues in ASD individuals, with a corresponding prosodic impairment. This suggests that, despite the existence of a domain-general processing mechanism (the CPS), key differences in the integration of features of language and music may lead to the prosodic impairment in ASD.
Witteman, Jurriaan; van Ijzendoorn, Marinus H; van de Velde, Daan; van Heuven, Vincent J J P; Schiller, Niels O
2011-11-01
It is unclear whether there is hemispheric specialization for prosodic perception and, if so, what the nature of this hemispheric asymmetry is. Using the lesion-approach, many studies have attempted to test whether there is hemispheric specialization for emotional and linguistic prosodic perception by examining the impact of left vs. right hemispheric damage on prosodic perception task performance. However, so far no consensus has been reached. In an attempt to find a consistent pattern of lateralization for prosodic perception, a meta-analysis was performed on 38 lesion studies (including 450 left hemisphere damaged patients, 534 right hemisphere damaged patients and 491 controls) of prosodic perception. It was found that both left and right hemispheric damage compromise emotional and linguistic prosodic perception task performance. Furthermore, right hemispheric damage degraded emotional prosodic perception more than left hemispheric damage (trimmed g=-0.37, 95% CI [-0.66; -0.09], N=620 patients). It is concluded that prosodic perception is under bihemispheric control with relative specialization of the right hemisphere for emotional prosodic perception. Copyright © 2011 Elsevier Ltd. All rights reserved.
Steinhauer, Karsten; DePriest, John; Koelsch, Stefan
2016-01-01
The processing of prosodic phrase boundaries in language is immediately reflected by a specific event-related potential component called the Closure Positive Shift (CPS). A component somewhat reminiscent of the CPS in language has also been reported for musical phrases (i.e., the so-called ‘music CPS’). However, in previous studies the quantification of the music-CPS as well as its morphology and timing differed substantially from the characteristics of the language-CPS. Therefore, the degree of correspondence between cognitive mechanisms of phrasing in music and in language has remained questionable. Here, we probed the shared nature of mechanisms underlying musical and prosodic phrasing by (1) investigating whether the music-CPS is present at phrase boundary positions where the language-CPS has been originally reported (i.e., at the onset of the pause between phrases), and (2) comparing the CPS in music and in language in non-musicians and professional musicians. For the first time, we report a positive shift at the onset of musical phrase boundaries that strongly resembles the language-CPS and argue that the post-boundary ‘music-CPS’ of previous studies may be an entirely distinct ERP component. Moreover, the language-CPS in musicians was found to be less prominent than in non-musicians, suggesting more efficient processing of prosodic phrases in language as a result of higher musical expertise. PMID:27192560
Glushko, Anastasia; Steinhauer, Karsten; DePriest, John; Koelsch, Stefan
2016-01-01
The processing of prosodic phrase boundaries in language is immediately reflected by a specific event-related potential component called the Closure Positive Shift (CPS). A component somewhat reminiscent of the CPS in language has also been reported for musical phrases (i.e., the so-called 'music CPS'). However, in previous studies the quantification of the music-CPS as well as its morphology and timing differed substantially from the characteristics of the language-CPS. Therefore, the degree of correspondence between cognitive mechanisms of phrasing in music and in language has remained questionable. Here, we probed the shared nature of mechanisms underlying musical and prosodic phrasing by (1) investigating whether the music-CPS is present at phrase boundary positions where the language-CPS has been originally reported (i.e., at the onset of the pause between phrases), and (2) comparing the CPS in music and in language in non-musicians and professional musicians. For the first time, we report a positive shift at the onset of musical phrase boundaries that strongly resembles the language-CPS and argue that the post-boundary 'music-CPS' of previous studies may be an entirely distinct ERP component. Moreover, the language-CPS in musicians was found to be less prominent than in non-musicians, suggesting more efficient processing of prosodic phrases in language as a result of higher musical expertise.
Prosody and alignment: a sequential perspective
NASA Astrophysics Data System (ADS)
Szczepek Reed, Beatrice
2010-12-01
In their analysis of a corpus of classroom interactions in an inner city high school, Roth and Tobin describe how teachers and students accomplish interactional alignment by prosodically matching each other's turns. Prosodic matching, and specific prosodic patterns are interpreted as signs of, and contributions to successful interactional outcomes and positive emotions. Lack of prosodic matching, and other specific prosodic patterns are interpreted as features of unsuccessful interactions, and negative emotions. This forum focuses on the article's analysis of the relation between interpersonal alignment, emotion and prosody. It argues that prosodic matching, and other prosodic linking practices, play a primarily sequential role, i.e. one that displays the way in which participants place and design their turns in relation to other participants' turns. Prosodic matching, rather than being a conversational action in itself, is argued to be an interactional practice (Schegloff 1997), which is not always employed for the accomplishment of `positive', or aligning actions.
Prosodic Similarity Effects in Short-Term Memory in Developmental Dyslexia.
Goswami, Usha; Barnes, Lisa; Mead, Natasha; Power, Alan James; Leong, Victoria
2016-11-01
Children with developmental dyslexia are characterized by phonological difficulties across languages. Classically, this 'phonological deficit' in dyslexia has been investigated with tasks using single-syllable words. Recently, however, several studies have demonstrated difficulties in prosodic awareness in dyslexia. Potential prosodic effects in short-term memory have not yet been investigated. Here we create a new instrument based on three-syllable words that vary in stress patterns, to investigate whether prosodic similarity (the same prosodic pattern of stressed and unstressed syllables) exerts systematic effects on short-term memory. We study participants with dyslexia and age-matched and younger reading-level-matched typically developing controls. We find that all participants, including dyslexic participants, show prosodic similarity effects in short-term memory. All participants exhibited better retention of words that differed in prosodic structure, although participants with dyslexia recalled fewer words accurately overall compared to age-matched controls. Individual differences in prosodic memory were predicted by earlier vocabulary abilities, by earlier sensitivity to syllable stress and by earlier phonological awareness. To our knowledge, this is the first demonstration of prosodic similarity effects in short-term memory. The implications of a prosodic similarity effect for theories of lexical representation and of dyslexia are discussed. © 2016 The Authors. Dyslexia published by John Wiley & Sons Ltd. © 2016 The Authors. Dyslexia published by John Wiley & Sons Ltd.
ERIC Educational Resources Information Center
Goswami, Usha; Gerson, Danielle; Astruc, Luisa
2010-01-01
Here we explore relations between auditory perception of amplitude envelope structure, prosodic sensitivity, and phonological awareness in a sample of 56 typically-developing children and children with developmental dyslexia. We examine whether rise time sensitivity is linked to prosodic sensitivity, and whether prosodic sensitivity is linked to…
ERIC Educational Resources Information Center
Kehoe, Margaret; Stoel-Gammon, Carol
1997-01-01
Examines different approaches to prosodic acquisition: Gerken's S(W) production template; Fikkert's and Archibald's theories of stress acquisition and Demuth and Fee's prosodic hierarchy account. Results reveal that current approaches cannot account for findings in the data such as the increased preservation of final over nonfinal unstressed…
Breen, Mara; Kaswer, Lianne; Van Dyke, Julie A.; Krivokapić, Jelena; Landi, Nicole
2016-01-01
Researchers have established a relationship between beginning readers' silent comprehension ability and their prosodic fluency, such that readers who read aloud with appropriate prosody tend to have higher scores on silent reading comprehension assessments. The current study was designed to investigate this relationship in two groups of high school readers: Specifically Poor Comprehenders (SPCs), who have adequate word level and phonological skills but poor reading comprehension ability, and a group of age- and decoding skill-matched controls. We compared the prosodic fluency of the two groups by determining how effectively they produced prosodic cues to syntactic and semantic structure in imitations of a model speaker's production of syntactically and semantically varied sentences. Analyses of pitch and duration patterns revealed that speakers in both groups produced the expected prosodic patterns; however, controls provided stronger durational cues to syntactic structure. These results demonstrate that the relationship between prosodic fluency and reading comprehension continues past the stage of early reading instruction. Moreover, they suggest that prosodically fluent speakers may also generate more fluent implicit prosodic representations during silent reading, leading to more effective comprehension. PMID:27486409
Bögels, Sara; Schriefers, Herbert; Vonk, Wietske; Chwilla, Dorothee J; Kerkhofs, Roel
2013-11-01
This ERP study investigates whether a superfluous prosodic break (i.e., a prosodic break that does not coincide with a syntactic break) has more severe processing consequences during auditory sentence comprehension than a missing prosodic break (i.e., the absence of a prosodic break at the position of a syntactic break). Participants listened to temporarily ambiguous sentences involving a prosody-syntax match or mismatch. The disambiguation of these sentences was always lexical in nature in the present experiment. This contrasts with a related study by Pauker, Itzhak, Baum, and Steinhauer (2011), where the disambiguation was of a lexical type for missing PBs and of a prosodic type for superfluous PBs. Our results converge with those of Pauker et al. (2011): superfluous prosodic breaks lead to more severe processing problems than missing prosodic breaks. Importantly, the present results extend those of Pauker et al. (2011) showing that this holds when the disambiguation is always lexical in nature. Furthermore, our results show that the way listeners use prosody can change over the course of the experiment which bears consequences for future studies. © 2013 Elsevier Ltd. All rights reserved.
Lin, Chin-Teng; Wu, Rui-Cheng; Chang, Jyh-Yeong; Liang, Sheng-Fu
2004-02-01
In this paper, a new technique for the Chinese text-to-speech (TTS) system is proposed. Our major effort focuses on the prosodic information generation. New methodologies for constructing fuzzy rules in a prosodic model simulating human's pronouncing rules are developed. The proposed Recurrent Fuzzy Neural Network (RFNN) is a multilayer recurrent neural network (RNN) which integrates a Self-cOnstructing Neural Fuzzy Inference Network (SONFIN) into a recurrent connectionist structure. The RFNN can be functionally divided into two parts. The first part adopts the SONFIN as a prosodic model to explore the relationship between high-level linguistic features and prosodic information based on fuzzy inference rules. As compared to conventional neural networks, the SONFIN can always construct itself with an economic network size in high learning speed. The second part employs a five-layer network to generate all prosodic parameters by directly using the prosodic fuzzy rules inferred from the first part as well as other important features of syllables. The TTS system combined with the proposed method can behave not only sandhi rules but also the other prosodic phenomena existing in the traditional TTS systems. Moreover, the proposed scheme can even find out some new rules about prosodic phrase structure. The performance of the proposed RFNN-based prosodic model is verified by imbedding it into a Chinese TTS system with a Chinese monosyllable database based on the time-domain pitch synchronous overlap add (TD-PSOLA) method. Our experimental results show that the proposed RFNN can generate proper prosodic parameters including pitch means, pitch shapes, maximum energy levels, syllable duration, and pause duration. Some synthetic sounds are online available for demonstration.
Processing of affective speech prosody is impaired in Asperger syndrome.
Korpilahti, Pirjo; Jansson-Verkasalo, Eira; Mattila, Marja-Leena; Kuusikko, Sanna; Suominen, Kalervo; Rytky, Seppo; Pauls, David L; Moilanen, Irma
2007-09-01
Many people with the diagnosis of Asperger syndrome (AS) show poorly developed skills in understanding emotional messages. The present study addressed discrimination of speech prosody in children with AS at neurophysiological level. Detection of affective prosody was investigated in one-word utterances as indexed by the N1 and the mismatch negativity (MMN) of auditory event-related potentials (ERPs). Data from fourteen boys with AS were compared with those for thirteen typically developed boys. These results suggest atypical neural responses to affective prosody in children with AS and their fathers, especially over the RH, and that this impairment can already be seen at low-level information processes. Our results provide evidence for familial patterns of abnormal auditory brain reactions to prosodic features of speech.
Processing of prosodic changes in natural speech stimuli in school-age children.
Lindström, R; Lepistö, T; Makkonen, T; Kujala, T
2012-12-01
Speech prosody conveys information about important aspects of communication: the meaning of the sentence and the emotional state or intention of the speaker. The present study addressed processing of emotional prosodic changes in natural speech stimuli in school-age children (mean age 10 years) by recording the electroencephalogram, facial electromyography, and behavioral responses. The stimulus was a semantically neutral Finnish word uttered with four different emotional connotations: neutral, commanding, sad, and scornful. In the behavioral sound-discrimination task the reaction times were fastest for the commanding stimulus and longest for the scornful stimulus, and faster for the neutral than for the sad stimulus. EEG and EMG responses were measured during non-attentive oddball paradigm. Prosodic changes elicited a negative-going, fronto-centrally distributed neural response peaking at about 500 ms from the onset of the stimulus, followed by a fronto-central positive deflection, peaking at about 740 ms. For the commanding stimulus also a rapid negative deflection peaking at about 290 ms from stimulus onset was elicited. No reliable stimulus type specific rapid facial reactions were found. The results show that prosodic changes in natural speech stimuli activate pre-attentive neural change-detection mechanisms in school-age children. However, the results do not support the suggestion of automaticity of emotion specific facial muscle responses to non-attended emotional speech stimuli in children. Copyright © 2012 Elsevier B.V. All rights reserved.
ERIC Educational Resources Information Center
Nickels, Stefanie; Steinhauer, Karsten
2018-01-01
The role of prosodic information in sentence processing is not usually addressed in second language (L2) instruction, and neurocognitive studies on prosody-syntax interactions are rare. Here we compare event-related potentials (ERP) of Chinese and German learners of English L2 to those of native English speakers and show how first language (L1)…
Syntax-Prosody Interface: Evidence from wh-Movement in Jordanian Arabic and Egyptian Arabic
ERIC Educational Resources Information Center
Yasin, Ayman
2012-01-01
Richards (2006, 2010) suggests that wh-movement is prosodically driven. Based on the position of the Comp(lementizer) and the marking of prosodic phrase edges, he claims that if Comp is on one side and the language marks the opposite side of prosodic phrases, then the wh-phrase does not move since Comp and wh-phrase can create a prosodic wh-domain…
Kim, Sahyang; Cho, Taehong
2009-05-01
This study investigated the role of phrase-level prosodic boundary information in word segmentation in Korean with two word-spotting experiments. In experiment 1, it was found that intonational cues alone helped listeners with lexical segmentation. Listeners paid more attention to local intonational cues (...H#L...) across the prosodic boundary than the intonational information within a prosodic phrase. The results imply that intonation patterns with high frequency are used, though not exclusively, in lexical segmentation. In experiment 2, final lengthening was added to see how multiple prosodic cues influence lexical segmentation. The results showed that listeners did not necessarily benefit from the presence of both intonational and final lengthening cues: Their performance was improved only when intonational information contained infrequent tonal patterns for boundary marking, showing only partially cumulative effects of prosodic cues. When the intonational information was optimal (frequent) for boundary marking, however, poorer performance was observed with final lengthening. This is arguably because the phrase-initial segmental allophonic cues for the accentual phrase were not matched with the prosodic cues for the intonational phrase. It is proposed that the asymmetrical use of multiple cues was due to interaction between prosodic and segmental information that are computed in parallel in lexical segmentation.
Acoustic Correlates of Focus Marking in Czech and Polish.
Hamlaoui, Fatima; Żygis, Marzena; Engelmann, Jonas; Wagner, Michael
2018-05-01
Languages vary in the type of contexts that affect prosodic prominence. This paper reports on a production study investigating how different types of foci influence prosody in Polish and Czech noun phrases. The results show that in both languages, focus and givenness are marked prosodically, with pitch and intensity as the main acoustic correlates. Like Germanic languages, Polish and Czech patterns show prosodic focus marking in a broad range of contexts and differ in this regard from other fixed-word-stress languages such as French. This suggests that (a) Polish and Czech are similar to Germanic languages and are unlike Romance languages in marking a variety of types of focus prosodically; (b) there is no close correlation between fixed word stress and lack of prosodic focus marking because Polish, which has fixed stress on the penult, shows prosodic focus marking for all types of focus; and (c) there is no straightforward relationship between flexible word order and whether focus and givenness are prosodically marked, contrary to earlier claims, because both Czech and Polish, with their relatively flexible word order, are more similar to English than Romance languages.
Utilization of Prosodic Information in Syntactic Ambiguity Resolution
2010-01-01
Two self paced listening experiments examined the role of prosodic phrasing in syntactic ambiguity resolution. In Experiment 1, the stimuli consisted of early closure sentences (e.g., “While the parents watched, the child sang a song.”) containing transitive-biased subordinate verbs paired with plausible direct objects or intransitive-biased subordinate verbs paired with implausible direct objects. Experiment 2 also contained early closure sentences with transitively and intransitive-biased subordinate verbs, but the subordinate verbs were always followed by plausible direct objects. In both experiments, there were two prosodic conditions. In the subject-biased prosodic condition, an intonational phrase boundary marked the clausal boundary following the subordinate verb. In the object-biased prosodic condition, the clause boundary was unmarked. The results indicate that lexical and prosodic cues interact at the subordinate verb and plausibility further affects processing at the ambiguous noun. Results are discussed with respect to models of the role of prosody in sentence comprehension. PMID:20033849
Prosodic development in middle childhood and adolescence in high-functioning autism.
Lyons, Megan; Schoen Simmons, Elizabeth; Paul, Rhea
2014-04-01
The present study aims to investigate the perception and production of several domains of prosodic performance in a cross-sectional sample of preadolescents and adolescents with and without high-functioning autism (HFA). To look at the role of language abilities on prosodic performance, the HFA groups were subdivided based on "high" and "low" language performance on the Clinical Evaluation of Language Fundamentals-Fourth Edition (CELF-4) (Semel, Wiig, & Secord). Social and cognitive abilities were also examined to determine their relationship to prosodic performance. No significant differences were seen in prosody scores in the younger versus older subgroups in typically developing (TD) group with age-appropriate language. There was small but significant improvement in performance with age in the groups with HFA. Comparing performance at each age level across diagnostic groups showed that preteens with HFA and higher language levels perform similarly to their TD peers on all prosodic tasks, whereas those with lower language skills scored significantly worse than both their higher language and TD peers when looking at composite perception and production findings. Teens with HFA showed no deficits on perception tasks; however, those with low language levels had difficulty on several production tasks when compared to the TD group. Regression analyses suggested that, for the preteen group with HFA, language was the strongest predictor of prosodic perception, whereas nonverbal IQ was most highly predictive of prosodic production. For adolescents with HFA, social skills significantly contributed to the prediction of prosodic perception and, along with language abilities, predicted prosodic production. Implications of these findings will be discussed. © 2014 International Society for Autism Research, Wiley Periodicals, Inc.
ERIC Educational Resources Information Center
Mietz, Anja; Toepel, Ulrike; Ischebeck, Anja; Alter, Kai
2008-01-01
The current study on German investigates Event-Related brain Potentials (ERPs) for the perception of sentences with intonations which are infrequent (i.e. vocatives) or inadequate in daily conversation. These ERPs are compared to the processing correlates for sentences in which the syntax-to-prosody relations are congruent and used frequently…
Effects of prosodic boundary on /aC/ sequences: articulatory results
NASA Astrophysics Data System (ADS)
Tabain, Marija
2003-05-01
This study presents EMA (electromagnetic articulography) data on articulation of the vowel /a/ at different prosodic boundaries in French. Three speakers of metropolitan French produced utterances containing the vowel /a/, preceded by /tee/ and followed by one of six consonants /bee dee gee eff ess sh/ (three stops and three fricatives), with different prosodic boundaries intervening between the /a/ and the six different consonants. The prosodic boundaries investigated are the Utterance, the Intonational phrase, the Accentual phrase, and the Word. Data for the Tongue Tip, Tongue Body, and Jaw are presented. The articulatory data presented here were recorded at the same time as the acoustic data presented in Tabain [J. Acoust. Soc. Am. 113, 516-531 (2003)]. Analyses show that there is a strong effect on peak displacement of the vowel according to the prosodic hierarchy, with the stronger prosodic boundaries inducing a much lower Tongue Body and Jaw position than the weaker prosodic boundaries. Durations of both the opening movement into and the closing movement out of the vowel are also affected. Peak velocity of the articulatory movements is also examined, and, contrary to results for phrase-final lengthening, it is found that peak velocity of the opening movement into the vowel tends to increase with the higher prosodic boundaries, together with the increased magnitude of the movement between the consonant and the vowel. Results for the closing movement out of the vowel and into the consonant are not so clear. Since one speaker shows evidence of utterance-level articulatory declension, it is suggested that the competing constraints of articulatory declension and prosodic effects might explain some previous results on phrase-final lengthening.
The Prosodic Basis of the Tiberian Hebrew System of Accents.
ERIC Educational Resources Information Center
Dresher, Bezalel Elan
1994-01-01
It is argued that the Tiberian system of accents that annotate the text of the Hebrew Bible has a prosodic basis. Tiberian representation can best be understood by integrating results of phonological, phonetic, and psycholinguistic research on prosodic structure. (93 references) (Author/LB)
Cohen, Alex S; Hong, S Lee; Guevara, Alvaro
2010-06-01
Emotional expression is an essential function for daily life that can be severely affected in some psychological disorders. Laboratory-based procedures designed to measure prosodic expression from natural speech have shown early promise for measuring individual differences in emotional expression but have yet to produce robust within-group prosodic changes across various evocative conditions. This report presents data from three separate studies (total N = 464) that digitally recorded subjects as they verbalized their reactions to various stimuli. Format and stimuli were modified to maximize prosodic expression. Our results suggest that use of evocative slides organized according to either a dimensional (e.g., high and low arousal - pleasant, unpleasant and neutral valence) or categorical (e.g., fear, surprise, happiness) models produced robust changes in subjective state but only negligible change in prosodic expression. Alternatively, speech from the recall of autobiographical memories resulted in meaningful changes in both subjective state and prosodic expression. Implications for the study of psychological disorders are discussed.
Prosody and Alignment: A Sequential Perspective
ERIC Educational Resources Information Center
Reed, Beatrice Szczepek
2010-01-01
In their analysis of a corpus of classroom interactions in an inner city high school, Roth and Tobin describe how teachers and students accomplish interactional alignment by prosodically matching each other's turns. Prosodic matching, and specific prosodic patterns are interpreted as signs of, and contributions to successful interactional outcomes…
Effects of Participant Engagement on Prosodic Prominence
ERIC Educational Resources Information Center
Buxó-Lugo, Andrés; Toscano, Joseph C.; Watson, Duane G.
2018-01-01
It is generally assumed that prosodic cues that provide linguistic information, like discourse status, are driven primarily by the information structure of the conversation. This article investigates whether speakers have the capacity to adjust subtle acoustic-phonetic properties of the prosodic signal when they find themselves in contexts in…
Palatalization and Intrinsic Prosodic Vowel Features in Russian
ERIC Educational Resources Information Center
Ordin, Mikhail
2011-01-01
The presented study is aimed at investigating the interaction of palatalization and intrinsic prosodic features of the vowel in CVC (consonant+vowel+consonant) syllables in Russian. The universal nature of intrinsic prosodic vowel features was confirmed with the data from the Russian language. It was found that palatalization of the consonants…
Prosodic Perception Problems in Spanish Dyslexia
ERIC Educational Resources Information Center
Cuetos, Fernando; Martínez-García, Cristina; Suárez-Coalla, Paz
2018-01-01
The aim of this study was to investigate the prosody abilities on top of phonological and visual abilities in children with dyslexia in Spanish that can be considered a syllable-timed language. The performances on prosodic tasks (prosodic perception, rise-time perception), phonological tasks (phonological awareness, rapid naming, verbal working…
Modeling the Relationship between Prosodic Sensitivity and Early Literacy
ERIC Educational Resources Information Center
Holliman, Andrew; Critten, Sarah; Lawrence, Tony; Harrison, Emily; Wood, Clare; Hughes, David
2014-01-01
A growing literature has demonstrated that prosodic sensitivity is related to early literacy development; however, the precise nature of this relationship remains unclear. It has been speculated in recent theoretical models that the observed relationship between prosodic sensitivity and early literacy might be partially mediated by children's…
Loutrari, Ariadne; Tselekidou, Freideriki; Proios, Hariklia
2018-02-27
Prosodic patterns of speech appear to make a critical contribution to memory-related processing. We considered the case of a previously unexplored prosodic feature of Greek storytelling and its effect on free recall in thirty typically developing children between the ages of 10 and 12 years, using short ecologically valid auditory stimuli. The combination of a falling pitch contour and, more notably, extensive final-syllable vowel lengthening, which gives rise to the prosodic feature in question, led to statistically significantly higher performance in comparison to neutral phrase-final prosody. Number of syllables in target words did not reveal substantial difference in performance. The current study presents a previously undocumented culturally-specific prosodic pattern and its effect on short-term memory.
Understanding speaker attitudes from prosody by adults with Parkinson's disease.
Monetta, Laura; Cheang, Henry S; Pell, Marc D
2008-09-01
The ability to interpret vocal (prosodic) cues during social interactions can be disrupted by Parkinson's disease, with notable effects on how emotions are understood from speech. This study investigated whether PD patients who have emotional prosody deficits exhibit further difficulties decoding the attitude of a speaker from prosody. Vocally inflected but semantically nonsensical 'pseudo-utterances' were presented to listener groups with and without PD in two separate rating tasks. Task I required participants to rate how confident a speaker sounded from their voice and Task 2 required listeners to rate how polite the speaker sounded for a comparable set of pseudo-utterances. The results showed that PD patients were significantly less able than HC participants to use prosodic cues to differentiate intended levels of speaker confidence in speech, although the patients could accurately detect the politelimpolite attitude of the speaker from prosody in most cases. Our data suggest that many PD patients fail to use vocal cues to effectively infer a speaker's emotions as well as certain attitudes in speech such as confidence, consistent with the idea that the basal ganglia play a role in the meaningful processing of prosodic sequences in spoken language (Pell & Leonard, 2003).
The Use of Prosodic Cues in Learning New Words in an Unfamiliar Language
ERIC Educational Resources Information Center
Kim, Sahyang; Broersma, Mirjam; Cho, Taehong
2012-01-01
The artificial language learning paradigm was used to investigate to what extent the use of prosodic features is universally applicable or specifically language driven in learning an unfamiliar language, and how nonnative prosodic patterns can be learned. Listeners of unrelated languages--Dutch (n = 100) and Korean (n = 100)--participated. The…
Contrast-Marking Prosodic Emphasis in Williams Syndrome: Results of Detailed Phonetic Analysis
ERIC Educational Resources Information Center
Ito, Kiwako; Martens, Marilee A.
2017-01-01
Background: Past reports on the speech production of individuals with Williams syndrome (WS) suggest that their prosody is anomalous and may lead to challenges in spoken communication. While existing prosodic assessments confirm that individuals with WS fail to use prosodic emphasis to express contrast, those reports typically lack detailed…
From Sound to Syntax: The Prosodic Bootstrapping of Clauses
ERIC Educational Resources Information Center
Hawthorne, Kara
2013-01-01
It has long been argued that prosodic cues may facilitate syntax acquisition (e.g., Morgan, 1986). Previous studies have shown that infants are sensitive to violations of typical correlations between clause-final prosodic cues (Hirsh-Pasek et al., 1987) and that prosody facilitates memory for strings of words (Soderstrom et al., 2005). This…
Immediate use of prosody and context in predicting a syntactic structure.
Nakamura, Chie; Arai, Manabu; Mazuka, Reiko
2012-11-01
Numerous studies have reported an effect of prosodic information on parsing but whether prosody can impact even the initial parsing decision is still not evident. In a visual world eye-tracking experiment, we investigated the influence of contrastive intonation and visual context on processing temporarily ambiguous relative clause sentences in Japanese. Our results showed that listeners used the prosodic cue to make a structural prediction before hearing disambiguating information. Importantly, the effect was limited to cases where the visual scene provided an appropriate context for the prosodic cue, thus eliminating the explanation that listeners have simply associated marked prosodic information with a less frequent structure. Furthermore, the influence of the prosodic information was also evident following disambiguating information, in a way that reflected the initial analysis. The current study demonstrates that prosody, when provided with an appropriate context, influences the initial syntactic analysis and also the subsequent cost at disambiguating information. The results also provide first evidence for pre-head structural prediction driven by prosodic and contextual information with a head-final construction. Copyright © 2012 Elsevier B.V. All rights reserved.
Prosodic Structure as a Parallel to Musical Structure
Heffner, Christopher C.; Slevc, L. Robert
2015-01-01
What structural properties do language and music share? Although early speculation identified a wide variety of possibilities, the literature has largely focused on the parallels between musical structure and syntactic structure. Here, we argue that parallels between musical structure and prosodic structure deserve more attention. We review the evidence for a link between musical and prosodic structure and find it to be strong. In fact, certain elements of prosodic structure may provide a parsimonious comparison with musical structure without sacrificing empirical findings related to the parallels between language and music. We then develop several predictions related to such a hypothesis. PMID:26733930
Phrase Lengths and the Perceived Informativeness of Prosodic Cues in Turkish.
Dinçtopal Deniz, Nazik; Fodor, Janet Dean
2017-12-01
It is known from previous studies that in many cases (though not all) the prosodic properties of a spoken utterance reflect aspects of its syntactic structure, and also that in many cases (though not all) listeners can benefit from these prosodic cues. A novel contribution to this literature is the Rational Speaker Hypothesis (RSH), proposed by Clifton, Carlson and Frazier. The RSH maintains that listeners are sensitive to possible reasons for why a speaker might introduce a prosodic break: "listeners treat a prosodic boundary as more informative about the syntax when it flanks short constituents than when it flanks longer constituents," because in the latter case the speaker might have been motivated solely by consideration of optimal phrase lengths. This would effectively reduce the cue value of an appropriately placed prosodic boundary. We present additional evidence for the RSH from Turkish, a language typologically different from English. In addition, our study shows for the first time that the RSH also applies to a prosodic break which conflicts with the syntactic structure, reducing its perceived cue strength if it might have been motivated by length considerations. In this case, the RSH effect is beneficial. Finally, the Turkish data show that prosody-based explanations for parsing preferences such as the RSH do not take the place of traditional syntax-sensitive parsing strategies such as Late Closure. The two sources of guidance co-exist; both are used when available.
Grammar and Frequency Effects in the Acquisition of Prosodic Words in European Portuguese
ERIC Educational Resources Information Center
Vigario, Marina; Freitas, Maria Joao; Frota, Sonia
2006-01-01
This paper investigates the acquisition of prosodic words in European Portuguese (EP) through analysis of grammatical and statistical properties of the target language and child speech. The analysis of grammatical properties shows that there are solid cues to the prosodic word (PW) in EP, and the presence of early word-based phonology in child…
ERIC Educational Resources Information Center
Lleo, Conxita
2006-01-01
This article examines the constraints on Prosodic Word production in Spanish by three monolingual and three Spanish-German bilingual children from the beginning of word production until 2;2. It also considers the relationship between Prosodic Words and Phonological Phrases, and in the case of monosyllabic words, it takes into consideration…
ERIC Educational Resources Information Center
Szendroi, Kriszta; Bernard, Carline; Berger, Frauke; Gervain, Judit; Hohle, Barbara
2018-01-01
Previous research on young children's knowledge of prosodic focus marking has revealed an apparent paradox, with comprehension appearing to lag behind production. Comprehension of prosodic focus is difficult to study experimentally due to its subtle and ambiguous contribution to pragmatic meaning. We designed a novel comprehension task, which…
ERIC Educational Resources Information Center
Chan, Jessica S.; Wade-Woolley, Lesly
2018-01-01
Background: This study was designed to extend our understanding of phonology and reading to include suprasegmental awareness using measures of prosodic awareness, which are complex tasks that tap into the rhythmic aspects of phonology. By requiring participants to access, reflect on and manipulate word stress, the prosodic awareness measures used…
Prosodic Awareness Is Related to Reading Ability in Children with Autism Spectrum Disorders
ERIC Educational Resources Information Center
Nash, Renae; Arciuli, Joanne
2016-01-01
Prosodic awareness has been linked with reading accuracy in typically developing children. Although children with autism spectrum disorders (ASD) often have difficulty processing prosody and often have trouble learning to read, no previous study has looked at the link between explicit prosodic awareness and reading in ASD. In the current study, 29…
Prosodic Abilities in Spanish and English Children with Williams Syndrome: A Cross-Linguistic Study
ERIC Educational Resources Information Center
Martinez-Castilla, Pastora; Stojanovik, Vesna; Setter, Jane; Sotillo, Maria
2012-01-01
The aim of this study was to compare the prosodic profiles of English- and Spanish-speaking children with Williams syndrome (WS), examining cross-linguistic differences. Two groups of children with WS, English and Spanish, of similar chronological and nonverbal mental age, were compared on performance in expressive and receptive prosodic tasks…
ERIC Educational Resources Information Center
Kim, Young-Suk Grace; Petscher, Yaacov
2016-01-01
Emerging evidence suggests that children's sensitivity to suprasegmental phonology such as stress and timing (i.e., prosodic sensitivity) contributes to reading. The primary goal of this study was to investigate pathways of the relation of prosodic sensitivity to reading (word reading and reading comprehension) using data from 370 first-grade…
Effects of gender and regional dialect on prosodic patterns in American English
Clopper, Cynthia G.; Smiljanic, Rajka
2011-01-01
While cross-dialect prosodic variation has been well established for many languages, most variationist research on regional dialects of American English has focused on the vowel system. The current study was designed to explore prosodic variation in read speech in two regional varieties of American English: Southern and Midland. Prosodic dialect variation was analyzed in two domains: speaking rate and the phonetic expression of pitch movements associated with accented and phrase-final syllables. The results revealed significant effects of regional dialect on the distributions of pauses, pitch accents, and phrasal-boundary tone combinations. Significant effects of talker gender were also observed on the distributions of pitch accents and phrasal-boundary tone combinations. The findings from this study demonstrate that regional and gender identity features are encoded in part through prosody, and provide further motivation for the close examination of prosodic patterns across regional and social varieties of American English. PMID:21686317
The Prosody of Topic Transition in Interaction: Pitch Register Variations.
Riou, Marine
2017-12-01
In conversation, speakers can mobilize a variety of prosodic cues to signal a switch in topics. This paper uses a mixed-methods approach combining Conversation Analysis and Instrumental Prosody to investigate the prosody of topic transition in American English, and analyzes the ways in which speakers can play on register level and on register span. A cluster of three prosodic parameters was found to be predictive of transitions: a higher maximum fundamental frequency (F0), a higher median F0 (key), and an expanded register span. Relative to speakers' habitual profiles, the mobilization of such prosodic cues corresponds to a marked upgraded prosodic design. This finding is consistent with the general assumption that continuation constitutes the norm in conversation, and that departing from it, as in the case of a topic transition, requires a marked action and marked linguistic design. The disjunctive action of opening a new topic corresponds to the use of a marked prosodic cue.
ERIC Educational Resources Information Center
Geiser, Eveline; Kjelgaard, Margaret; Christodoulou, Joanna A.; Cyr, Abigail; Gabrieli, John D. E.
2014-01-01
Reading disability in children with dyslexia has been proposed to reflect impairment in auditory timing perception. We investigated one aspect of timing perception--"temporal grouping"--as present in prosodic phrase boundaries of natural speech, in age-matched groups of children, ages 6-8 years, with and without dyslexia. Prosodic phrase…
ERIC Educational Resources Information Center
Witteman, Jurriaan; van IJzendoorn, Marinus H.; van de Velde, Daan; van Heuven, Vincent J. J. P.; Schiller, Niels O.
2011-01-01
It is unclear whether there is hemispheric specialization for prosodic perception and, if so, what the nature of this hemispheric asymmetry is. Using the lesion-approach, many studies have attempted to test whether there is hemispheric specialization for emotional and linguistic prosodic perception by examining the impact of left vs. right…
ERIC Educational Resources Information Center
Demuth, Katherine; Tomas, Ekaterina
2016-01-01
A growing body of research with typically developing children has begun to show that the acquisition of grammatical morphemes interacts not only with a developing knowledge of syntax, but also with developing abilities at the interface with prosodic phonology. In particular, a Prosodic Licensing approach to these issues provides a framework for…
Prosodic domain-initial effects on the acoustic structure of vowels
NASA Astrophysics Data System (ADS)
Fox, Robert Allen; Jacewicz, Ewa; Salmons, Joseph
2003-10-01
In the process of language change, vowels tend to shift in ``chains,'' leading to reorganizations of entire vowel systems over time. A long research tradition has described such patterns, but little is understood about what factors motivate such shifts. Drawing data from changes in progress in American English dialects, the broad hypothesis is tested that changes in vowel systems are related to prosodic organization and stress patterns. Changes in vowels under greater prosodic prominence correlate directly with, and likely underlie, historical patterns of shift. This study examines acoustic characteristics of vowels at initial edges of prosodic domains [Fougeron and Keating, J. Acoust. Soc. Am. 101, 3728-3740 (1997)]. The investigation is restricted to three distinct prosodic levels: utterance (sentence-initial), phonological phrase (strong branch of a foot), and syllable (weak branch of a foot). The predicted changes in vowels /e/ and /ɛ/ in two American English dialects (from Ohio and Wisconsin) are examined along a set of acoustic parameters: duration, formant frequencies (including dynamic changes over time), and fundamental frequency (F0). In addition to traditional methodology which elicits list-like intonation, a design is adapted to examine prosodic patterns in more typical sentence intonations. [Work partially supported by NIDCD R03 DC005560-01.
Lexical and Prosodic Effects on Syntactic Ambiguity Resolution in Aphasia
DeDe, Gayle
2012-01-01
The purpose of this study was to determine whether and when individuals with aphasia and healthy controls use lexical and prosodic information during on-line sentence comprehension. Individuals with aphasia and controls (n = 12 per group) participated in a self-paced listening experiment. The stimuli were early closure sentences, such as “While the parents watched(,) the child sang a song.” Both lexical and prosodic cues were manipulated. The cues were biased toward the subject- or object- of the ambiguous noun phrase (the child). Thus, there were two congruous conditions (in which both lexical cues and prosodic cues were consistent) and two incongruous conditions (in which lexical and prosodic cues conflicted). The results showed that the people with aphasia had longer listening times for the ambiguous noun phrase (the child) when the cues were conflicting, rather than consistent. The controls showed effects earlier in the sentence, at the subordinate verb (watched or danced). Both groups showed evidence of reanalysis at the main verb (sang). These effects demonstrate that the aphasic group was sensitive to the lexical and prosodic cues, but used them on a delayed time course relative to the control group. PMID:22143353
ERIC Educational Resources Information Center
Quigley, Jean; McNally, Sinéad; Lawson, Sarah
2016-01-01
Research has indicated differences in prosodic expression for infants-at-risk-of-autism spectrum disorders (ASD), and it has been proposed that caregiver speech to these infants may also be moderated prosodically. In typical development, the pitch range of maternal infant-directed speech (IDS) narrows and utterance intensity decreases with infant…
ERIC Educational Resources Information Center
Holliman, A. J.; Gutiérrez Palma, N.; Critten, S.; Wood, C.; Cunnane, H.; Pillinger, C.
2017-01-01
This study was designed to examine the independent contribution of prosodic sensitivity--the rhythmic patterning of speech-to word reading and spelling in a sample of early readers. Ninety-three English-speaking children aged 5-6 years old (M = 69.28 months, SD = 3.67) were assessed for their prosodic sensitivity, vocabulary knowledge,…
Focus prosody of telephone numbers in Tokyo Japanese.
Lee, Yong-Cheol; Nambu, Satoshi; Cho, Sunghye
2018-05-01
Using production and perception experiments, this study examined whether the prosodic structure inherent to telephone numbers in Tokyo Japanese affects the realization of focus prosody as well as its perception. It was hypothesized that prosodic marking of focus differs by position within the digit groups of phone number strings. Overall, focus prosody of telephone numbers was not clearly marked, resulting in poor identification in perception. However, a difference between positions within digit groups was identified, reflecting a prosodic structure where one position is assigned an accentual peak instead of the other. The findings suggest that, conforming to a language-specific prosodic structure, focus prosody within a language can vary under the influence of a particular linguistic environment.
Dutch and English toddlers' use of linguistic cues in predicting upcoming turn transitions
Lammertink, Imme; Casillas, Marisa; Benders, Titia; Post, Brechtje; Fikkert, Paula
2015-01-01
Adults achieve successful coordination during conversation by using prosodic and lexicosyntactic cues to predict upcoming changes in speakership. We examined the relative weight of these linguistic cues in the prediction of upcoming turn structure by toddlers learning Dutch (Experiment 1; N = 21) and British English (Experiment 2; N = 20) and adult control participants (Dutch: N = 16; English: N = 20). We tracked participants' anticipatory eye movements as they watched videos of dyadic puppet conversation. We controlled the prosodic and lexicosyntactic cues to turn completion for a subset of the utterances in each conversation to create four types of target utterances (fully incomplete, incomplete syntax, incomplete prosody, and fully complete). All participants (Dutch and English toddlers and adults) used both prosodic and lexicosyntactic cues to anticipate upcoming speaker changes, but weighed lexicosyntactic cues over prosodic ones when the two were pitted against each other. The results suggest that Dutch and English toddlers are already nearly adult-like in their use of prosodic and lexicosyntactic cues in anticipating upcoming turn transitions. PMID:25964772
Kauschke, Christina; Renner, Lena; Domahs, Ulrike
2013-08-01
Recent studies suggest that morphosyntactic difficulties may result from prosodic problems. We therefore address the interface between inflectional morphology and prosody in typically developing children (TD) and children with SLI by testing whether these groups are sensitive to prosodic constraints that guide plural formation in German. A plural elicitation task was designed consisting of 60 words and 20 pseudowords. The performance of 14 German-speaking children with SLI (mean age 7.5) was compared to age-matched controls and to younger children matched for productive vocabulary. TD children performed significantly better than children with SLI. Error analyses revealed that children with SLI produced more forms that did not meet the optimal shape of a noun plural. Beyond the fact that children with SLI have deficits in plural marking, the findings suggest that they also show reduced sensitivity to prosodic requirements. In other words, the prosodic structure of inflected words seems to be vulnerable in children with SLI.
ERIC Educational Resources Information Center
Brentari, Diane; Nadolske, Marie A.; Wolford, George
2012-01-01
In this paper the prosodic structure of American Sign Language (ASL) narratives is analyzed in deaf native signers (L1-D), hearing native signers (L1-H), and highly proficient hearing second language signers (L2-H). The results of this study show that the prosodic patterns used by these groups are associated both with their ASL language experience…
Pauker, Efrat; Itzhak, Inbal; Baum, Shari R; Steinhauer, Karsten
2011-10-01
In reading, a comma in the wrong place can cause more severe misunderstandings than the lack of a required comma. Here, we used ERPs to demonstrate that a similar effect holds for prosodic boundaries in spoken language. Participants judged the acceptability of temporarily ambiguous English "garden path" sentences whose prosodic boundaries were either in line or in conflict with the actual syntactic structure. Sentences with incongruent boundaries were accepted less than those with missing boundaries and elicited a stronger on-line brain response in ERPs (N400/P600 components). Our results support the notion that mentally deleting an overt prosodic boundary is more costly than postulating a new one and extend previous findings, suggesting an immediate role of prosody in sentence comprehension. Importantly, our study also provides new details on the profile and temporal dynamics of the closure positive shift (CPS), an ERP component assumed to reflect prosodic phrasing in speech and music in real time. We show that the CPS is reliably elicited at the onset of prosodic boundaries in English sentences and is preceded by negative components. Its early onset distinguishes the speech CPS in adults both from prosodic ERP correlates in infants and from the "music CPS" previously reported for trained musicians.
Prosodic Temporal Alignment of Co-Speech Gestures to Speech Facilitates Referent Resolution
ERIC Educational Resources Information Center
Jesse, Alexandra; Johnson, Elizabeth K.
2012-01-01
Using a referent detection paradigm, we examined whether listeners can determine the object speakers are referring to by using the temporal alignment between the motion speakers impose on objects and their labeling utterances. Stimuli were created by videotaping speakers labeling a novel creature. Without being explicitly instructed to do so,…
Aziz-Zadeh, Lisa; Sheng, Tong; Gheytanchi, Anahita
2010-01-01
Background Prosody, the melody and intonation of speech, involves the rhythm, rate, pitch and voice quality to relay linguistic and emotional information from one individual to another. A significant component of human social communication depends upon interpreting and responding to another person's prosodic tone as well as one's own ability to produce prosodic speech. However there has been little work on whether the perception and production of prosody share common neural processes, and if so, how these might correlate with individual differences in social ability. Methods The aim of the present study was to determine the degree to which perception and production of prosody rely on shared neural systems. Using fMRI, neural activity during perception and production of a meaningless phrase in different prosodic intonations was measured. Regions of overlap for production and perception of prosody were found in premotor regions, in particular the left inferior frontal gyrus (IFG). Activity in these regions was further found to correlate with how high an individual scored on two different measures of affective empathy as well as a measure on prosodic production ability. Conclusions These data indicate, for the first time, that areas that are important for prosody production may also be utilized for prosody perception, as well as other aspects of social communication and social understanding, such as aspects of empathy and prosodic ability. PMID:20098696
Gleaning Structure from Sound: The Role of Prosodic Contrast in Learning Non-Adjacent Dependencies
ERIC Educational Resources Information Center
Grama, Ileana C.; Kerkhoff, Annemarie; Wijnen, Frank
2016-01-01
The ability to detect non-adjacent dependencies (i.e. between "a" and "b" in "aXb") in spoken input may support the acquisition of morpho-syntactic dependencies (e.g. "The princess 'is' kiss'ing' the frog"). Functional morphemes in morpho-syntactic dependencies are often marked by perceptual cues that render…
ERIC Educational Resources Information Center
Ari, Omer
2009-01-01
Fluency instruction has had limited effects on reading comprehension relative to reading rate and prosodic reading (Dowhower, 1987; Herman, 1985; National Institute of Child Health and Human Development, 2000a). More specific components (i.e., error detection) of comprehension may yield larger effects through exposure to a wider range of materials…
Stepanov, Arthur; Pavlič, Matic; Stateva, Penka; Reboul, Anne
2018-01-01
This study investigated whether early bilingualism and early musical training positively influence the ability to discriminate between prosodic patterns corresponding to different syntactic structures in otherwise phonetically identical sentences in an unknown language. In a same-different discrimination task, participants (N = 108) divided into four groups (monolingual non-musicians, monolingual musicians, bilingual non-musicians, and bilingual musicians) listened to pairs of short sentences in a language unknown to them (French). In discriminating phonetically identical but prosodically different sentences, musicians, bilinguals, and bilingual musicians outperformed the controls. However, there was no interaction between bilingualism and musical training to suggest an additive effect. These results underscore the significant role of both types of experience in enhancing the listeners' sensitivity to prosodic information.
Pursuing prosody interventions.
Hargrove, Patricia M
2013-08-01
This paper provides an overview of evidence-based prosodic intervention strategies to facilitate clinicians' inclusion of prosody in their therapeutic planning and to encourage researchers' interest in prosody as an area of specialization. Four current evidence-based prosodic interventions are reviewed and answers to some important clinical questions are proposed. Additionally, the future direction of prosodic intervention research is discussed in recommendations about issues that are of concern to clinicians. The paper ends with a call for participation in an online collaboration at the Clinical Prosody blog at clinicalprosody.wordpress.com.
Janssen, Simone; Schmidt, Sabine
2009-07-01
The perception of prosodic cues in human speech may be rooted in mechanisms common to mammals. The present study explores to what extent bats use rhythm and frequency, typically carrying prosodic information in human speech, for the classification of communication call series. Using a two-alternative, forced choice procedure, we trained Megaderma lyra to discriminate between synthetic contact call series differing in frequency, rhythm on level of calls and rhythm on level of call series, and measured the classification performance for stimuli differing in only one, or two, of the above parameters. A comparison with predictions from models based on one, combinations of two, or all, parameters revealed that the bats based their decision predominantly on frequency and in addition on rhythm on the level of call series, whereas rhythm on level of calls was not taken into account in this paradigm. Moreover, frequency and rhythm on the level of call series were evaluated independently. Our results show that parameters corresponding to prosodic cues in human languages are perceived and evaluated by bats. Thus, these necessary prerequisites for a communication via prosodic structures in mammals have evolved far before human speech.
Acquisition of stress and pitch accent in English-Spanish bilingual children
NASA Astrophysics Data System (ADS)
Kim, Sahyang; Andruski, Jean; Nathan, Geoffrey S.; Casielles, Eugenia; Work, Richard
2005-09-01
Although understanding of prosodic development is considered crucial for understanding of language acquisition in general, few studies have focused on how children develop native-like prosody in their speech production. This study will examine the acquisition of lexical stress and postlexical pitch accent in two English-Spanish bilingual children. Prosodic characteristics of English and Spanish are different in terms of frequent stress patterns (trochaic versus penultimate), phonetic realization of stress (reduced unstressed vowel versus full unstressed vowel), and frequent pitch accent types (H* versus L*+H), among others. Thus, English-Spanish bilingual children's prosodic development may provide evidence of their awareness of language differences relatively early during language development, and illustrate the influence of markedness or input frequency in prosodic acquisition. For this study, recordings from the children's one-word stage are used. Durations of stressed and unstressed syllables and F0 peak alignment are measured, and pitch accent types in different accentual positions (nuclear versus prenuclear) are transcribed using American English ToBI and Spanish ToBI. Prosodic development is compared across ages within each language and across languages at each age. Furthermore, the bilingual children's productions are compared with monolingual English and Spanish parents' productions.
Prosodic Boundaries in Writing: Evidence from a Keystroke Analysis
Fuchs, Susanne; Krivokapić, Jelena
2016-01-01
The aim of the paper is to investigate duration between successive keystrokes during typing in order to examine whether prosodic boundaries are expressed in the process of writing. In particular, we are interested in interkey durations that occur next to punctuation marks (comma and full stops while taking keystrokes between words as a reference), since these punctuation marks are often realized with minor or major prosodic boundaries during overt reading. A two-part experiment was conducted: first, participants’ keystrokes on a computer keyboard were recorded while writing an email to a close friend (in two conditions: with and without time pressure). Second, participants read the email they just wrote. Interkey durations were compared to pause durations at the same locations during read speech. Results provide evidence of significant differences between interkey durations between words, at commas and at full stops (from shortest to longest). These durations were positively correlated with silent pause durations during overt reading. A more detailed analysis of interkey durations revealed patterns that can be interpreted with respect to prosodic boundaries in speech production, namely as phrase-final and phrase-initial lengthening occurring at punctuation marks. This work provides initial evidence that prosodic boundaries are reflected in the writing process. PMID:27917129
Prosodic Boundaries in Writing: Evidence from a Keystroke Analysis.
Fuchs, Susanne; Krivokapić, Jelena
2016-01-01
The aim of the paper is to investigate duration between successive keystrokes during typing in order to examine whether prosodic boundaries are expressed in the process of writing. In particular, we are interested in interkey durations that occur next to punctuation marks (comma and full stops while taking keystrokes between words as a reference), since these punctuation marks are often realized with minor or major prosodic boundaries during overt reading. A two-part experiment was conducted: first, participants' keystrokes on a computer keyboard were recorded while writing an email to a close friend (in two conditions: with and without time pressure). Second, participants read the email they just wrote. Interkey durations were compared to pause durations at the same locations during read speech. Results provide evidence of significant differences between interkey durations between words, at commas and at full stops (from shortest to longest). These durations were positively correlated with silent pause durations during overt reading. A more detailed analysis of interkey durations revealed patterns that can be interpreted with respect to prosodic boundaries in speech production, namely as phrase-final and phrase-initial lengthening occurring at punctuation marks. This work provides initial evidence that prosodic boundaries are reflected in the writing process.
Prosody and informativity: A cross-linguistic investigation
NASA Astrophysics Data System (ADS)
Ouyang, Iris Chuoying
This dissertation aims to extend our knowledge of prosody -- in particular, what kinds of information may be conveyed through prosody, which prosodic dimensions may be used to convey them, and how individual speakers differ from one another in how they use prosody. Four production studies were conducted to examine how various factors interact with one another in shaping the prosody of an utterance and how prosody fulfills its multi-functional role. Experiments 1 explores the interaction between two types of informativity, namely information structure and information-theoretic properties. The results show that the prosodic consequences of new-information focus are modulated by the focused word's frequency, whereas the prosodic consequences of corrective focus are modulated by the focused word's probability in the context. Furthermore, f0 ranges appear to be more informative than f0 shapes in reflecting informativity across speakers. Specifically, speakers seem to have individual 'preferences' regarding f0 shapes, the f0 ranges they use for an utterance, and the magnitude of differences in f0 ranges by which they mark information-structural distinctions. In contrast, there is more cross-speaker validity in the actual directions of differences in f0 ranges between information-structural types. Experiments 2 and 3 further show that the interaction found between corrective focus and contextual probability depends on the interlocutor's knowledge state. When the interlocutor has no access to the crucial information concerning utterances' contextual probability, speakers prosodically emphasize contextually improbable corrections, but not contextually probable corrections. Furthermore, speakers prosodically emphasize the corrections in response to contextually probable misstatements, but not the corrections in response to contextually improbable misstatements. In contrast, completely opposite patterns are found when words' contextual probability is shared knowledge between the speaker and the interlocutor: speakers prosodically emphasize contextually probable corrections and the corrections in response to contextually improbable misstatements. Experiment 4 demonstrates the multi-functionality of prosody by investigating its discourse-level functions in Mandarin Chinese, a tone language where a word's prosodic patterns is crucial to its meaning. The results show that, although prosody serves fundamental, lexical-level functions in Mandarin Chinese, it nevertheless provides cues to information structure as well. Similar to what has been found with English, corrective information is prosodically more prominent than non-corrective information, and new information is prosodically more prominent than given information. Taken together, these experiments demonstrate the complex relationship between prosody and the different types of information it encodes in a given language. To better understand prosody, it is important to integrate insights from different traditions of research and to investigate across languages. In addition, the findings of this research suggest that speakers' assumptions about what their interlocutors know -- as well as speakers' ability to update these expectations -- play a key role in shaping the prosody of utterances. I hypothesize that prosodic prominence may reflect the gap between what speakers had expected their interlocutors to say and what their interlocutors have actually said.
Prosodic differences between declaratives and interrogatives in infant-directed speech.
Geffen, Susan; Mintz, Toben H
2017-07-01
In many languages, declaratives and interrogatives differ in word order properties, and in syntactic organization more broadly. Thus, in order to learn the distinct syntactic properties of the two sentence types, learners must first be able to distinguish them using non-syntactic information. Prosodic information is often assumed to be a useful basis for this type of discrimination, although no systematic studies of the prosodic cues available to infants have been reported. Analysis of maternal speech in three Standard American English-speaking mother-infant dyads found that polar interrogatives differed from declaratives on the patterning of pitch and duration on the final two syllables, but wh-questions did not. Thus, while prosody is unlikely to aid discrimination of declaratives from wh-questions, infant-directed speech provides prosodic information that infants could use to distinguish declaratives and polar interrogatives. We discuss how learners could leverage this information to identify all question forms, in the context of syntax acquisition.
Atypical prosody in Asperger syndrome: perceptual and acoustic measurements.
Filipe, Marisa G; Frota, Sónia; Castro, São Luís; Vicente, Selene G
2014-08-01
It is known that individuals with Asperger syndrome (AS) may show no problems with regard to what is said (e.g., lexical content) but tend to have difficulties in how utterances are produced, i.e., they may show prosodic impairments. In the present study, we focus on the use of prosodic features to express grammatical meaning. Specifically, we explored the sentence type difference between statements and questions that is conveyed by intonation, using perceptual and acoustic measurements. Children aged 8 and 9 years with AS (n = 12) were matched according to age and nonverbal intelligence with typically developing peers (n = 17). Although children with AS could produce categorically accurate prosodic patterns, their prosodic contours were perceived as odd by adult listeners, and acoustic measurements showed alterations in duration and pitch. Additionally, children with AS had greater variability in fundamental frequency contours compared to typically developing peers.
Prosody and parsing in coordination structures.
Schepman, A; Rodway, P
2000-05-01
The effect of prosodic boundary cues on the off-line disambiguation and on-line parsing of coordination structures was examined. It was found that relative clauses were attached to coordinated object noun phrases in preference to second conjuncts in sentences like: The lawyer greeted the powerful barrister and the wise judge who was/were walking to the courtroom. Naive speakers signalled the syntactic contrast between the two structures by a prosodic break between the conjuncts when the relative clause was attached to the second conjunct. Listeners were able to use this prosodic information in both off-line syntactic disambiguation and on-line syntactic parsing. The findings are compatible with a model in which prosody has a strong immediate effect on parsing. It is argued that the current experimental design has avoided confounds present in earlier studies on the on-line integration of prosodic and syntactic information.
Further characterisation of the functional neuroanatomy associated with prosodic emotion decoding.
Mitchell, Rachel L C
2013-06-01
Current models of prosodic emotion comprehension propose a three stage cognition mediated by temporal lobe auditory regions through to inferior and orbitofrontal regions. Cumulative evidence suggests that its mediation may be more flexible though, with a facility to respond in a graded manner based on the need for executive control. The location of this fine-tuning system is unclear, as is its similarity to the cognitive control system. In the current study, need for executive control was manipulated in a block-design functional MRI study by systematically altering the proportion of incongruent trials across time, i.e., trials for which participants identified prosodic emotions in the face of conflicting lexico-semantic emotion cues. Resultant Blood Oxygenation Level Dependent contrast data were analysed according to standard procedures using Statistical Parametric Mapping v8 (Ashburner et al., 2009). In the parametric analyses, superior (medial) frontal gyrus activity increased linearly with increased need for executive control. In the separate analyses of each level of incongruity, results suggested that the baseline prosodic emotion comprehension system was sufficient to deal with low proportions of incongruent trials, whereas a more widespread frontal lobe network was required for higher proportions. These results suggest an executive control system for prosodic emotion comprehension exists which has the capability to recruit superior (medial) frontal gyrus in a graded manner and other frontal regions once demand exceeds a certain threshold. The need to revise current models of prosodic emotion comprehension and add a fourth processing stage are discussed. Copyright © 2012 Elsevier Ltd. All rights reserved.
Contrast-marking prosodic emphasis in Williams syndrome: results of detailed phonetic analysis.
Ito, Kiwako; Martens, Marilee A
2017-01-01
Past reports on the speech production of individuals with Williams syndrome (WS) suggest that their prosody is anomalous and may lead to challenges in spoken communication. While existing prosodic assessments confirm that individuals with WS fail to use prosodic emphasis to express contrast, those reports typically lack detailed phonetic analysis of speech data. The present study examines the acoustic properties of speech prosody, aiming for the future development of targeted speech interventions. The study examines the three primary acoustic correlates of prosodic emphasis (duration, intensity, F0) and determines whether individuals with WS have difficulty in producing all or a particular set of the three prosodic cues. Speech produced by 12 individuals with WS and 12 chronological age (CA)-matched typically developing individuals were recorded. A sequential picture-naming task elicited production of target phrases in three contexts: (1) no contrast: gorilla with a racket → rabbit with a balloon; (2) contrast on the animal: fox with a balloon → rabbit with a balloon; and (3) contrast on the object: rabbit with a ball → rabbit with a balloon. The three acoustic correlates of prosodic prominence (duration, intensity and F0) were compared across the three referential contexts. The two groups exhibited striking similarities in their use of word duration and intensity for expressing contrast. Both groups showed the reduction and enhancement of final lengthening, and the enhancement and reduction of intensity difference for the animal contrast and for the object contrast conditions, respectively. The two groups differed in their use of F0: the CA group produced higher F0 for the animal than for the object regardless of the context, and this difference was enhanced when the animal noun was contrastive. In contrast, the WS group produced higher F0 for the object than for the animal when the object was contrastive. The present data contradict previous assessment results that report a lack of prosodic skills to mark contrast in individuals with WS. The methodological differences that may account for this variability are discussed. The present data suggest that individuals with WS produce appropriate prosodic cues to express contrast, although their use of pitch may be somewhat atypical. Additional data and future speech comprehension studies will determine whether pitch modulation can be targeted for speech intervention in individuals with WS. © 2016 Royal College of Speech and Language Therapists.
Prosodic Contrasts in Ironic Speech
ERIC Educational Resources Information Center
Bryant, Gregory A.
2010-01-01
Prosodic features in spontaneous speech help disambiguate implied meaning not explicit in linguistic surface structure, but little research has examined how these signals manifest themselves in real conversations. Spontaneously produced verbal irony utterances generated between familiar speakers in conversational dyads were acoustically analyzed…
Punctuation, Prosody, and Discourse: Afterthought Vs. Right Dislocation
Kalbertodt, Janina; Primus, Beatrice; Schumacher, Petra B.
2015-01-01
In a reading production experiment we investigate the impact of punctuation and discourse structure on the prosodic differentiation of right dislocation (RD) and afterthought (AT). Both discourse structure and punctuation are likely to affect the prosodic marking of these right-peripheral constructions, as certain prosodic markings are appropriate only in certain discourse structures, and punctuation is said to correlate with prosodic phrasing. With RD and AT clearly differing in discourse function (comment-topic structuring vs. disambiguation) and punctuation (comma vs. full stop), critical items in this study were manipulated with regard to the (mis-)match of these parameters. Since RD and AT are said to prosodically differ in pitch range, phrasing, and accentuation patterns, we measured the reduction of pitch range, boundary strength and prominence level. Results show an effect of both punctuation and discourse context (mediated by syntax) on phrasing and accentuation. Interestingly, for pitch range reduction no difference between RDs and ATs could be observed. Our results corroborate a language architecture model in which punctuation, prosody, syntax, and discourse-semantics are independent but interacting domains with correspondence constraints between them. Our findings suggest there are tight correspondence constraints between (i) punctuation (full stop and comma in particular) and syntax, (ii) prosody and syntax as well as (iii) prosody and discourse-semantics. PMID:26648883
Prosodic structure shapes the temporal realization of intonation and manual gesture movements.
Esteve-Gibert, Núria; Prieto, Pilar
2013-06-01
Previous work on the temporal coordination between gesture and speech found that the prominence in gesture coordinates with speech prominence. In this study, the authors investigated the anchoring regions in speech and pointing gesture that align with each other. The authors hypothesized that (a) in contrastive focus conditions, the gesture apex is anchored in the intonation peak and (b) the upcoming prosodic boundary influences the timing of gesture and intonation movements. Fifteen Catalan speakers pointed at a screen while pronouncing a target word with different metrical patterns in a contrastive focus condition and followed by a phrase boundary. A total of 702 co-speech deictic gestures were acoustically and gesturally analyzed. Intonation peaks and gesture apexes showed parallel behavior with respect to their position within the accented syllable: They occurred at the end of the accented syllable in non-phrase-final position, whereas they occurred well before the end of the accented syllable in phrase-final position. Crucially, the position of intonation peaks and gesture apexes was correlated and was bound by prosodic structure. The results refine the phonological synchronization rule (McNeill, 1992), showing that gesture apexes are anchored in intonation peaks and that gesture and prosodic movements are bound by prosodic phrasing.
Sensitivity to visual prosodic cues in signers and nonsigners.
Brentari, Diane; González, Carolina; Seidl, Amanda; Wilbur, Ronnie
2011-03-01
Three studies are presented in this paper that address how nonsigners perceive the visual prosodic cues in a sign language. In Study 1, adult American nonsigners and users of American Sign Language (ASL) were compared on their sensitivity to the visual cues in ASL Intonational Phrases. In Study 2, hearing, nonsigning American infants were tested using the same stimuli used in Study I to see whether maturity, exposure to gesture, or exposure to sign language is necessary to demonstrate this type of sensitivity. Study 3 addresses nonsigners' and signers' strategies for segmenting Prosodic Words in a sign language. Adult participants from six language groups (3 spoken languages and 3 sign languages) were tested.The results of these three studies indicate that nonsigners have a high degree of sensitivity to sign language prosodic cues at the Intonational Phrase level and the Prosodic Word level; these are attributed to modality or'channel' effects of the visual signal.There are also some differences between signers' and nonsigners' sensitivity; these differences are attributed to language experience or language-particular constraints.This work is useful in understanding the gestural competence of nonsigners and the ways in which this type of competence may contribute to the grammaticalization of these properties in a sign language.
Sequential and prosodic design of English and Greek non-valenced news receipts.
Kaimaki, Marianna
2012-03-01
Results arising from a prosodic and interactional study of the organization of everyday talk in English suggest that news receipts can be grouped into two categories: valenced (e.g., oh good) and non-valenced (e.g., oh really). In-depth investigation of both valenced and non-valenced news receipts shows that differences in their prosodic design do not seem to affect the sequential structure of the news informing sequence. News receipts with falling and rising pitch may have the same uptake and are treated in the same way by co-participants. A preliminary study of a Greek telephone corpus yielded the following receipts of news announcements: a malista, a(h) orea, a ne, a, oh. These are news markers composed of a standalone particle or a particle followed by an adverb or a response token (ne). Analysis of the sequential and prosodic design of Greek news announcement sequences is made to determine any interactional patterns and/or prosodic constraints. By examining the way in which co-participants display their interpretation of these turns I show that the phonological systems of contrast are different depending on the sequential environment, in much the same way that consonantal systems of contrast are not the same syllable initially and finally.
Dilley, Laura C; Wieland, Elizabeth A; Gamache, Jessica L; McAuley, J Devin; Redford, Melissa A
2013-02-01
As children mature, changes in voice spectral characteristics co-vary with changes in speech, language, and behavior. In this study, spectral characteristics were manipulated to alter the perceived ages of talkers' voices while leaving critical acoustic-prosodic correlates intact, to determine whether perceived age differences were associated with differences in judgments of prosodic, segmental, and talker attributes. Speech was modified by lowering formants and fundamental frequency, for 5-year-old children's utterances, or raising them, for adult caregivers' utterances. Next, participants differing in awareness of the manipulation (Experiment 1A) or amount of speech-language training (Experiment 1B) made judgments of prosodic, segmental, and talker attributes. Experiment 2 investigated the effects of spectral modification on intelligibility. Finally, in Experiment 3, trained analysts used formal prosody coding to assess prosodic characteristics of spectrally modified and unmodified speech. Differences in perceived age were associated with differences in ratings of speech rate, fluency, intelligibility, likeability, anxiety, cognitive impairment, and speech-language disorder/delay; effects of training and awareness of the manipulation on ratings were limited. There were no significant effects of the manipulation on intelligibility or formally coded prosody judgments. Age-related voice characteristics can greatly affect judgments of speech and talker characteristics, raising cautionary notes for developmental research and clinical work.
Perceptual effects of dialectal and prosodic variation in vowels
NASA Astrophysics Data System (ADS)
Fox, Robert Allen; Jacewicz, Ewa; Hatcher, Kristin; Salmons, Joseph
2005-09-01
As was reported earlier [Fox et al., J. Acoust. Soc. Am. 114, 2396 (2003)], certain vowels in the Ohio and Wisconsin dialects of American English are shifting in different directions. In addition, we have found that the spectral characteristics of these vowels (e.g., duration and formant frequencies) changed systematically under varying degrees of prosodic prominence, with somewhat different changes occurring within each dialect. The question addressed in the current study is whether naive listeners from these two dialects are sensitive to both the dialect variations and to the prosodically induced spectral differences. Listeners from Ohio and Wisconsin listened to the stimulus tokens [beIt] and [bɛt] produced in each of three prosodic contexts (representing three different levels of prominence). These words were produced by speakers from Ohio or from Wisconsin (none of the listeners were also speakers). Listeners identified the stimulus tokens in terms of vowel quality and indicated whether it was a good, fair, or poor exemplar of that phonetic category. Results showed that both phonetic quality decisions and goodness ratings were systematically and significantly affected by speaker dialect, listener dialect, and prosodic context. Implications of source and nature of ongoing vowel changes in these two dialects will be discussed. [Work partially supported by NIDCD R03 DC005560-01.
Domahs, Ulrike; Klein, Elise; Huber, Walter; Domahs, Frank
2013-06-01
Using a stress violation paradigm, we investigated whether metrical feet constrain the way prosodic patterns are processed and evaluated. Processing of correctly versus incorrectly stressed words was associated with activation in left posterior angular and retrosplenial cortex, indicating the recognition of an expected and familiar pattern, whereas the inverse contrast yielded enhanced bilateral activation in the superior temporal gyrus, reflecting higher costs in auditory (re-)analysis. More fine-grained analyses of severe versus mild stress violations revealed activations of the left superior temporal and left anterior angular gyrus whereas the opposite contrast led to frontal activations including Broca's area and its right-hemisphere homologue, suggesting that detection of mild violations lead to increased effort in working memory and deeper phonological processing. Our results provide first evidence that different incorrect stress patterns are processed in a qualitatively different way and that the underlying foot structure seems to determine potential stress positions in German words. Copyright © 2013 Elsevier Inc. All rights reserved.
The Role of Prosodic Sensitivity in Children's Reading Development
ERIC Educational Resources Information Center
Whalley, Karen; Hansen, Julie
2006-01-01
While the critical importance of phonological awareness (segmental phonology) to reading ability is well established, the potential role of prosody (suprasegmental phonology) in reading development has only recently been explored. This study examined the relationship between children's prosodic skills and reading ability. Hierarchical multiple…
Prosodic Phonological Representations Early in Visual Word Recognition
ERIC Educational Resources Information Center
Ashby, Jane; Martin, Andrea E.
2008-01-01
Two experiments examined the nature of the phonological representations used during visual word recognition. We tested whether a minimality constraint (R. Frost, 1998) limits the complexity of early representations to a simple string of phonemes. Alternatively, readers might activate elaborated representations that include prosodic syllable…
Interactivity in Prosodic Representations in Children
ERIC Educational Resources Information Center
Goffman, Lisa; Westover, Stefanie
2013-01-01
The aim of this study was to determine, using speech error and articulatory analyses, whether the binary distinction between iambs and trochees should be extended to include additional prosodic subcategories. Adults, children who are normally developing, and children with specific language impairment (SLI) participated. Children with SLI were…
Children's Use of the Prosodic Characteristics of Infant-Directed Speech.
ERIC Educational Resources Information Center
Weppelman, Tammy L.; Bostow, Angela; Schiffer, Ryan; Elbert-Perez, Evelyn; Newman, Rochelle S.
2003-01-01
Examined whether young children (4 years of age) show prosodic changes when speaking to infants. Measured children's word duration in infant-directed speech compared to adult-directed speech, examined amplitude variability, and examined both average fundamental frequency and fundamental frequency standard deviation. Results indicate that…
Dilley, Laura C.; Wieland, Elizabeth A.; Gamache, Jessica L.; McAuley, J. Devin; Redford, Melissa A.
2013-01-01
Purpose As children mature, changes in voice spectral characteristics covary with changes in speech, language, and behavior. Spectral characteristics were manipulated to alter the perceived ages of talkers’ voices while leaving critical acoustic-prosodic correlates intact, to determine whether perceived age differences were associated with differences in judgments of prosodic, segmental, and talker attributes. Method Speech was modified by lowering formants and fundamental frequency, for 5-year-old children’s utterances, or raising them, for adult caregivers’ utterances. Next, participants differing in awareness of the manipulation (Exp. 1a) or amount of speech-language training (Exp. 1b) made judgments of prosodic, segmental, and talker attributes. Exp. 2 investigated the effects of spectral modification on intelligibility. Finally, in Exp. 3 trained analysts used formal prosody coding to assess prosodic characteristics of spectrally-modified and unmodified speech. Results Differences in perceived age were associated with differences in ratings of speech rate, fluency, intelligibility, likeability, anxiety, cognitive impairment, and speech-language disorder/delay; effects of training and awareness of the manipulation on ratings were limited. There were no significant effects of the manipulation on intelligibility or formally coded prosody judgments. Conclusions Age-related voice characteristics can greatly affect judgments of speech and talker characteristics, raising cautionary notes for developmental research and clinical work. PMID:23275414
ERIC Educational Resources Information Center
Orestrom, Bengt
A study analyzed four dyadic conversations for evidence of the signals operating in the turn-taking process and facilitating the smooth exchange of turns. It found over 20 syntactic, prosodic, and semantic features occurring frequently with turn-taking. The five most significant factors correlating with turn-taking were a prosodically completed…
Prosodic Features and Speech Naturalness in Individuals with Dysarthria
ERIC Educational Resources Information Center
Klopfenstein, Marie I.
2012-01-01
Despite the importance of speech naturalness to treatment outcomes, little research has been done on what constitutes speech naturalness and how to best maximize naturalness in relationship to other treatment goals like intelligibility. In addition, previous literature alludes to the relationship between prosodic aspects of speech and speech…
Prosodic Adaptations to Pitch Perturbation in Running Speech
ERIC Educational Resources Information Center
Patel, Rupal; Niziolek, Caroline; Reilly, Kevin; Guenther, Frank H.
2011-01-01
Purpose: A feedback perturbation paradigm was used to investigate whether prosodic cues are controlled independently or in an integrated fashion during sentence production. Method: Twenty-one healthy speakers of American English were asked to produce sentences with emphatic stress while receiving real-time auditory feedback of their productions.…
Pauses and Intonational Phrasing: ERP Studies in 5-Month-Old German Infants and Adults
ERIC Educational Resources Information Center
Mannel, Claudia; Friederici, Angela D.
2009-01-01
In language learning, infants are faced with the challenge of decomposing continuous speech into relevant units, such as syntactic clauses and words. Within the framework of prosodic bootstrapping, behavioral studies suggest infants approach this segmentation problem by relying on prosodic information, especially on acoustically marked…
Prosodic Awareness and Punctuation Ability in Adult Readers
ERIC Educational Resources Information Center
Heggie, Lindsay; Wade-Woolley, Lesly
2018-01-01
We examined the relationship between two metalinguistic tasks: prosodic awareness and punctuation ability. Specifically, we investigated whether adults' ability to punctuate was related to the degree to which they are aware of and able to manipulate prosody in spoken language. English-speaking adult readers (n = 115) were administered a receptive…
Truncation Without Shape Constraints: The Latter Stages of Prosodic Acquisition.
ERIC Educational Resources Information Center
Kehoe, Margaret M.
2000-01-01
Evaluates the claim of uniform size and shape restrictions in prosodic development using a cross-sectional database of English-speaking children's multisyllabic word productions. Suggests children's increasing faithfulness to unstressed syllables can be explained by different constraint rankings that relate to edge alignment, syllable structure,…
Prosodic Awareness and Children's Multisyllabic Word Reading
ERIC Educational Resources Information Center
Holliman, Andrew J.; Mundy, Ian R.; Wade-Woolley, Lesly; Wood, Clare; Bird, Chelsea
2017-01-01
Prosodic awareness (the rhythmic patterning of speech) accounts for unique variance in reading development. However, studies have thus far focused on early readers and utilised literacy measures which fail to distinguish between monosyllabic and multisyllabic words. The current study investigated the factors that are specifically associated with…
Prosodic Skills in Children with Down Syndrome and in Typically Developing Children
ERIC Educational Resources Information Center
Zampini, Laura; Fasolo, Mirco; Spinelli, Maria; Zanchi, Paola; Suttora, Chiara; Salerni, Nicoletta
2016-01-01
Background: Many studies have analysed language development in children with Down syndrome to understand better the nature of their linguistic delays and the reason why these delays, particularly those in the morphosyntactic area, seem greater than their cognitive impairment. However, the prosodic characteristics of language development in…
A Closer Look at Formulaic Language: Prosodic Characteristics of Swedish Proverbs
ERIC Educational Resources Information Center
Hallin, Anna Eva; Van Lancker Sidtis, Diana
2017-01-01
Formulaic expressions (such as idioms, proverbs, and conversational speech formulas) are currently a topic of interest. Examination of prosody in formulaic utterances, a less explored property of formulaic expressions, has yielded controversial views. The present study investigates prosodic characteristics of proverbs, as one type of formulaic…
Prosodic Encoding in Silent Reading.
ERIC Educational Resources Information Center
Wilkenfeld, Deborah
In silent reading, short-memory tasks, such as semantic and syntactic processing, require a stage of phonetic encoding between visual representation and the actual extraction of meaning, and this encoding includes prosodic as well as segmental features. To test for this suprasegmental coding, an experiment was conducted in which subjects were…
Prosodic Disambiguation in Child-Directed Speech
ERIC Educational Resources Information Center
Kempe, Vera; Schaeffler, Sonja; Thoresen, John C.
2010-01-01
The study examines whether speakers exaggerate prosodic cues to syntactic structure when addressing young children. In four experiments, 72 mothers and 48 non-mothers addressed either real 2-4-year old or imaginary children as well as adult confederates using syntactically ambiguous sentences like "Touch the cat with the spoon" intending to convey…
Prosodic Disambiguation of Syntactic Structure: For the Speaker or for the Addressee?
ERIC Educational Resources Information Center
Kraljic, Tanya; Brennan, Susan E.
2005-01-01
Evidence has been mixed on whether speakers spontaneously and reliably produce prosodic cues that resolve syntactic ambiguities. And when speakers do produce such cues, it is unclear whether they do so ''for'' their addressees (the "audience design" hypothesis) or ''for'' themselves, as a by-product of planning and articulating utterances. Three…
Infant-Mother Acoustic-Prosodic Alignment and Developmental Risk
ERIC Educational Resources Information Center
Seidl, Amanda; Cristia, Alejandrina; Soderstrom, Melanie; Ko, Eon-Suk; Abel, Emily A.; Kellerman, Ashleigh; Schwichtenberg, A. J.
2018-01-01
Purpose: One promising early marker for autism and other communicative and language disorders is early infant speech production. Here we used daylong recordings of high- and low-risk infant-mother dyads to examine whether acoustic-prosodic alignment as well as two automated measures of infant vocalization are related to developmental risk status…
Prosodic Differences between Declaratives and Interrogatives in Infant-Directed Speech
ERIC Educational Resources Information Center
Geffen, Susan; Mintz, Toben H.
2017-01-01
In many languages, declaratives and interrogatives differ in word order properties, and in syntactic organization more broadly. Thus, in order to learn the distinct syntactic properties of the two sentence types, learners must first be able to distinguish them using non-syntactic information. Prosodic information is often assumed to be a useful…
The Relationship between Form and Function Level Receptive Prosodic Abilities in Autism
ERIC Educational Resources Information Center
Jarvinen-Pasley, Anna; Peppe, Susan; King-Smith, Gavin; Heaton, Pamela
2008-01-01
Prosody can be conceived as having form (auditory-perceptual characteristics) and function (pragmatic/linguistic meaning). No known studies have examined the relationship between form- and function-level prosodic skills in relation to the effects of stimulus length and/or complexity upon such abilities in autism. Research in this area is both…
ERIC Educational Resources Information Center
Hoole, Philip; Bombien, Lasse
2017-01-01
Purpose: The purpose of this study is to use prosodic and syllable-structure variation to probe the underlying representation of laryngeal kinematics in languages traditionally considered to differ in voicing typology (German vs. Dutch and French). Method: Transillumination and videofiberendoscopic filming were used to investigate the devoicing…
Impaired Perception of Syllable Stress in Children with Dyslexia: A Longitudinal Study
ERIC Educational Resources Information Center
Goswami, Usha; Mead, Natasha; Fosker, Tim; Huss, Martina; Barnes, Lisa; Leong, Victoria
2013-01-01
Prosodic patterning is a key structural element of spoken language. However, the potential role of prosodic awareness in the phonological difficulties that characterise children with developmental dyslexia has been little studied. Here we report the first longitudinal study of sensitivity to syllable stress in children with dyslexia, enabling the…
Is Prosodic Production Driven by Lexical Development? Longitudinal Evidence from Babble and Words
ERIC Educational Resources Information Center
De Clerck, Ilke; Pettinato, Michele; Verhoeven, Jo; Gillis, Steven
2017-01-01
This study investigated the relation between lexical development and the production of prosodic prominence in disyllabic babble and words. Monthly recordings from nine typically developing Belgian-Dutch-speaking infants were analyzed from the onset of babbling until a cumulative vocabulary of 200 words was reached. The differentiation between the…
Influences of Semantic and Prosodic Cues on Word Repetition and Categorization in Autism
ERIC Educational Resources Information Center
Singh, Leher; Harrow, MariLouise S.
2014-01-01
Purpose: To investigate sensitivity to prosodic and semantic cues to emotion in individuals with high-functioning autism (HFA). Method: Emotional prosody and semantics were independently manipulated to assess the relative influence of prosody versus semantics on speech processing. A sample of 10-year-old typically developing children (n = 10) and…
ERIC Educational Resources Information Center
Criado de Val, Manuel
1975-01-01
Discusses the esthetic ideas of Henri Bergson with particular emphasis on his thoughts about "intuition" in art and the importance of the prosodic features of literary style. The prosodic quality parallels or reproduces the rhythm of the thoughts being expressed. (Text is in Spanish.) (TL)
The semantics of prosody: acoustic and perceptual evidence of prosodic correlates to word meaning.
Nygaard, Lynne C; Herold, Debora S; Namy, Laura L
2009-01-01
This investigation examined whether speakers produce reliable prosodic correlates to meaning across semantic domains and whether listeners use these cues to derive word meaning from novel words. Speakers were asked to produce phrases in infant-directed speech in which novel words were used to convey one of two meanings from a set of antonym pairs (e.g., big/small). Acoustic analyses revealed that some acoustic features were correlated with overall valence of the meaning. However, each word meaning also displayed a unique acoustic signature, and semantically related meanings elicited similar acoustic profiles. In two perceptual tests, listeners either attempted to identify the novel words with a matching meaning dimension (picture pair) or with mismatched meaning dimensions. Listeners inferred the meaning of the novel words significantly more often when prosody matched the word meaning choices than when prosody mismatched. These findings suggest that speech contains reliable prosodic markers to word meaning and that listeners use these prosodic cues to differentiate meanings. That prosody is semantic suggests a reconceptualization of traditional distinctions between linguistic and nonlinguistic properties of spoken language. Copyright © 2009 Cognitive Science Society, Inc.
Li, Jackie P W; Law, Thomas; Lam, Gary Y H; To, Carol K S
2013-01-01
English-speaking children with Autism Spectrum Disorders (ASD) are less capable of using prosodic cues such as intonation for irony comprehension. Prosodic cues, in particular intonation, in Cantonese are relatively restricted while sentence-final particles (SFPs) may be used for this pragmatic function. This study investigated the use of prosodic cues and SFPs in irony comprehension in Cantonese-speaking children with and without ASD. Thirteen children with ASD (8;3-12;9) were language-matched with 13 typically developing (TD) peers. By manipulating prosodic cues and SFPs, 16 stories with an ironic remark were constructed. Participants had to judge the speaker's belief and intention. Both groups performed similarly well in judging the speaker's belief. For the speaker's intention, the TD group relied more on SFPs. The ASD group performed significantly poorer and did not rely on either cue. SFPs may play a salient role in Cantonese irony comprehension. The differences between the two groups were discussed by considering the literature on theory of mind.
Borrie, Stephanie A.; Lubold, Nichola; Pon-Barry, Heather
2015-01-01
Conversational entrainment, a pervasive communication phenomenon in which dialogue partners adapt their behaviors to align more closely with one another, is considered essential for successful spoken interaction. While well-established in other disciplines, this phenomenon has received limited attention in the field of speech pathology and the study of communication breakdowns in clinical populations. The current study examined acoustic-prosodic entrainment, as well as a measure of communicative success, in three distinctly different dialogue groups: (i) healthy native vs. healthy native speakers (Control), (ii) healthy native vs. foreign-accented speakers (Accented), and (iii) healthy native vs. dysarthric speakers (Disordered). Dialogue group comparisons revealed significant differences in how the groups entrain on particular acoustic–prosodic features, including pitch, intensity, and jitter. Most notably, the Disordered dialogues were characterized by significantly less acoustic-prosodic entrainment than the Control dialogues. Further, a positive relationship between entrainment indices and communicative success was identified. These results suggest that the study of conversational entrainment in speech pathology will have essential implications for both scientific theory and clinical application in this domain. PMID:26321996
Bone, Daniel; Lee, Chi-Chun; Black, Matthew P.; Williams, Marian E.; Lee, Sungbok; Levitt, Pat; Narayanan, Shrikanth
2015-01-01
Purpose The purpose of this study was to examine relationships between prosodic speech cues and autism spectrum disorder (ASD) severity, hypothesizing a mutually interactive relationship between the speech characteristics of the psychologist and the child. The authors objectively quantified acoustic-prosodic cues of the psychologist and of the child with ASD during spontaneous interaction, establishing a methodology for future large-sample analysis. Method Speech acoustic-prosodic features were semiautomatically derived from segments of semistructured interviews (Autism Diagnostic Observation Schedule, ADOS; Lord, Rutter, DiLavore, & Risi, 1999; Lord et al., 2012) with 28 children who had previously been diagnosed with ASD. Prosody was quantified in terms of intonation, volume, rate, and voice quality. Research hypotheses were tested via correlation as well as hierarchical and predictive regression between ADOS severity and prosodic cues. Results Automatically extracted speech features demonstrated prosodic characteristics of dyadic interactions. As rated ASD severity increased, both the psychologist and the child demonstrated effects for turn-end pitch slope, and both spoke with atypical voice quality. The psychologist’s acoustic cues predicted the child’s symptom severity better than did the child’s acoustic cues. Conclusion The psychologist, acting as evaluator and interlocutor, was shown to adjust his or her behavior in predictable ways based on the child’s social-communicative impairments. The results support future study of speech prosody of both interaction partners during spontaneous conversation, while using automatic computational methods that allow for scalable analysis on much larger corpora. PMID:24686340
Phrase boundary effects on the temporal kinematics of sequential tongue tip consonants1
Byrd, Dani; Lee, Sungbok; Campos-Astorkiza, Rebeka
2008-01-01
This study evaluates the effects of phrase boundaries on the intra- and intergestural kinematic characteristics of blended gestures, i.e., overlapping gestures produced with a single articulator. The sequences examined are the juncture geminate [d(#)d], the sequence [d(#)z], and, for comparison, the singleton tongue tip gesture in [d(#)b]. This allows the investigation of the process of gestural aggregation [Munhall, K. G., and Löfqvist, A. (1992). “Gestural aggregation in speech: laryngeal gestures,” J. Phonetics 20, 93–110] and the manner in which it is affected by prosodic structure. Juncture geminates are predicted to be affected by prosodic boundaries in the same way as other gestures; that is, they should display prosodic lengthening and lesser overlap across a boundary. Articulatory prosodic lengthening is also investigated using a signal alignment method of the functional data analysis framework [Ramsay, J. O., and Silverman, B. W. (2005). Functional Data Analysis, 2nd ed. (Springer-Verlag, New York)]. This provides the ability to examine a time warping function that characterizes relative timing difference (i.e., lagging or advancing) of a test signal with respect to a given reference, thus offering a way of illuminating local nonlinear deformations at work in prosodic lengthening. These findings are discussed in light of the π-gesture framework of Byrd and Saltzman [(2003) “The elastic phrase: Modeling the dynamics of boundary-adjacent lengthening,” J. Phonetics 31, 149–180]. PMID:18537396
Prosodic skills in children with Down syndrome and in typically developing children.
Zampini, Laura; Fasolo, Mirco; Spinelli, Maria; Zanchi, Paola; Suttora, Chiara; Salerni, Nicoletta
2016-01-01
Many studies have analysed language development in children with Down syndrome to understand better the nature of their linguistic delays and the reason why these delays, particularly those in the morphosyntactic area, seem greater than their cognitive impairment. However, the prosodic characteristics of language development in children with Down syndrome have been scarcely investigated. To analyse the prosodic skills of children with Down syndrome in the production of multi-word utterances. Data on the prosodic skills of these children were compared with data on typically developing children matched on developmental age and vocabulary size. Between-group differences and the relationships between prosodic and syntactic skills were investigated. The participants were nine children with Down syndrome (who ranged in chronological age from 45 to 63 months and had a mean developmental age of 30 months) and 12 30-month-old typically developing children. The children in both groups had a vocabulary size of approximately 450 words. The children's spontaneous productions were recorded during observations of mother-child play sessions. Data analyses showed that despite their morphosyntactic difficulties, children with Down syndrome were able to master some aspects of prosody in multi-word utterances. They were able to produce single intonation multi-word utterances on the same level as typically developing children. In addition, the intonation contour of their utterances was not negatively influenced by syntactic complexity, contrary to what occurred in typically developing children, although it has to be considered that the utterances produced by children with Down syndrome were less complex than those produced by children in the control group. However, children with Down syndrome appeared to be less able than typically developing children to use intonation to express the pragmatic interrogative function. The findings are discussed considering the effects of social experience on the utterance prosodic realization. © 2015 Royal College of Speech and Language Therapists.
Atypical Prosody in Asperger Syndrome: Perceptual and Acoustic Measurements
ERIC Educational Resources Information Center
Filipe, Marisa G.; Frota, Sónia; Castro, São Luís; Vicente, Selene G.
2014-01-01
It is known that individuals with Asperger syndrome (AS) may show no problems with regard to what is said (e.g., lexical content) but tend to have difficulties in how utterances are produced, i.e., they may show prosodic impairments. In the present study, we focus on the use of prosodic features to express grammatical meaning. Specifically, we…
ERIC Educational Resources Information Center
Bone, Daniel; Lee, Chi-Chun; Black, Matthew P.; Williams, Marian E.; Lee, Sungbok; Levitt, Pat; Narayanan, Shrikanth
2014-01-01
Purpose: The purpose of this study was to examine relationships between prosodic speech cues and autism spectrum disorder (ASD) severity, hypothesizing a mutually interactive relationship between the speech characteristics of the psychologist and the child. The authors objectively quantified acoustic-prosodic cues of the psychologist and of the…
Effects of Prosodic Cues on Topic Continuity in Child Language Production
ERIC Educational Resources Information Center
Vernice, Mirta; Guasti, Maria Teresa
2014-01-01
It remains controversial whether children are able to process and integrate specific linguistic cues in their mental model to the same extent as adults. In the present study, a sentence continuation task was employed to determine how Italian speakers (4-, 5-, 6-year-olds and adults) interpret prosodic cues to decide which referent is more salient…
Prosodic Marking of Information Structure by Malaysian Speakers of English
ERIC Educational Resources Information Center
Gut, Ulrike; Pillai, Stefanie
2014-01-01
Various researchers have shown that second language (L2) speakers have difficulties with marking information structure in English prosodically: They deviate from native speakers not only in terms of pitch accent placement (Grosser, 1997; Gut, 2009; Ramírez Verdugo, 2002) and the type of pitch accent they produce (Wennerstrom, 1994, 1998) but also…
ERIC Educational Resources Information Center
Gokgoz Kurt, Burcu; Medlin, Julie; Tessarolo, Ashley
2014-01-01
Considering the contradictory research on explicit teaching of suprasegmentals, the present study aims to investigate the effects of explicit instruction of L2 English learners' perception of prosodically ambiguous intonation patterns, as well as the possible effects of reported musical familiarity on intonation acquisition. A control group and a…
ERIC Educational Resources Information Center
Prieto, Pilar
2006-01-01
This paper focuses on the development of Prosodic Word shapes in Catalan, a language which differs from both Spanish and English in the distribution of PW structures. Of particular interest are the truncations of initial unstressed syllables, and how these develop over time. Developmental qualitative and quantitative data from seven…
ERIC Educational Resources Information Center
Koiso, Hanae; Horiuchi, Yasuo; Tutiya, Syun; Ichikawa, Akira; Den, Yasuharu
1998-01-01
Investigates syntactic and prosodic features of speakers' speech at points where turn-taking and backchannels occur, focusing on an analysis of Japanese spontaneous dialogs. The study shows that in both turn-taking and backchannels, some instances of syntactic features make extremely strong contributions, and syntax has a stronger contribution…
Prosodic Awareness Skills and Literacy Acquisition in Spanish
ERIC Educational Resources Information Center
Defior, Sylvia; Gutierrez-Palma, Nicolas; Cano-Marin, Maria Jose
2012-01-01
There has been very little research in Spanish on the potential role of prosodic skills in reading and spelling acquisition, which is the subject of the present study. A total of 85 children in 5th year of Primary Education (mean age 10 years and 9 months) performed tests assessing memory, stress awareness, phonological awareness, reading and…
Sensitivity to Visual Prosodic Cues in Signers and Nonsigners
ERIC Educational Resources Information Center
Brentari, Diane; Gonzalez, Carolina; Seidl, Amanda; Wilbur, Ronnie
2011-01-01
Three studies are presented in this paper that address how nonsigners perceive the visual prosodic cues in a sign language. In Study 1, adult American nonsigners and users of American Sign Language (ASL) were compared on their sensitivity to the visual cues in ASL Intonational Phrases. In Study 2, hearing, nonsigning American infants were tested…
Prosodic and Phonemic Awareness in Children's Reading of Long and Short Words
ERIC Educational Resources Information Center
Wade-Woolley, Lesly
2016-01-01
Phonemic and prosodic awareness are both phonological processes that operate at different levels: the former at the level of the individual sound segment and the latter at the suprasegmental level across syllables. Both have been shown to be related to word reading in young readers. In this study we examine how these processes are differentially…
On the Prosodic Expression of Pragmatic Prominence: The Case of Pitch Register Lowering in Akan
ERIC Educational Resources Information Center
Kugler, Frank; Genzel, Susanne
2012-01-01
This article presents data from three production experiments investigating the prosodic means of encoding information structure in Akan, a tone language that belongs to the Kwa branch of the Niger-Congo family, spoken in Ghana. Information structure was elicited via context questions that put target words either in wide, informational, or…
Utterance-Final Lengthening Is Predictive of Infants' Discrimination of English Accents
ERIC Educational Resources Information Center
White, Laurence; Floccia, Caroline; Goslin, Jeremy; Butler, Joseph
2014-01-01
Infants in their first year manifest selective patterns of discrimination between languages and between accents of the same language. Prosodic differences are held to be important in whether languages can be discriminated, together with the infant's familiarity with one or both of the accents heard. However, the nature of the prosodic cues that…
Prosodic Abilities of Spanish-Speaking Adolescents and Adults with Williams Syndrome
ERIC Educational Resources Information Center
Martinez-Castilla, Pastora; Sotillo, Maria; Campos, Ruth
2011-01-01
In spite of the relevant role of prosody in communication, and in contrast with other linguistic components, there is paucity of research in this field for Williams syndrome (WS). Therefore, this study performed a systematic assessment of prosodic abilities in WS. The Spanish version of the Profiling Elements of Prosody in Speech-Communication…
ERIC Educational Resources Information Center
Patel, Rupal
2003-01-01
Studies of prosodic control in severe dysarthria (DYS) have focused on differences between impaired and nonimpaired speech in terms of the range and variation of fundamental frequency (F0), intensity, and duration. Whether individuals with severe DYS can adequately signal prosodic contrasts and "which" acoustic cues they use to do so has received…
Prosodic Markers of Saliency in Humorous Narratives
ERIC Educational Resources Information Center
Pickering, Lucy; Corduas, Marcella; Eisterhold, Jodi; Seifried, Brenna; Eggleston, Alyson; Attardo, Salvatore
2009-01-01
Much of what we think we know about the performance of humor relies on our intuitions about prosody (e.g., "it's all about timing"); however, this has never been empirically tested. Thus, the central question addressed in this article is whether speakers mark punch lines in jokes prosodically and, if so, how. To answer this question,…
ERIC Educational Resources Information Center
Dilley, Laura C.; Wieland, Elizabeth A.; Gamache, Jessica L.; McAuley, J. Devin; Redford, Melissa A.
2013-01-01
Purpose: As children mature, changes in voice spectral characteristics co-vary with changes in speech, language, and behavior. In this study, spectral characteristics were manipulated to alter the perceived ages of talkers' voices while leaving critical acoustic-prosodic correlates intact, to determine whether perceived age differences were…
ERIC Educational Resources Information Center
Beattie, Rachel L.; Manis, Franklin R.
2014-01-01
Studies have begun to focus on what skills contribute to the development of phonological awareness, an important predictor of reading attainment. One of these skills is the perception of prosody, which is the rhythm, tempo and stress of a language. To examine whether prosodic perception contributes to phonological awareness prior to reading…
Pragmatically Framed Cross-Situational Noun Learning Using Computational Reinforcement Models
Najnin, Shamima; Banerjee, Bonny
2018-01-01
Cross-situational learning and social pragmatic theories are prominent mechanisms for learning word meanings (i.e., word-object pairs). In this paper, the role of reinforcement is investigated for early word-learning by an artificial agent. When exposed to a group of speakers, the agent comes to understand an initial set of vocabulary items belonging to the language used by the group. Both cross-situational learning and social pragmatic theory are taken into account. As social cues, joint attention and prosodic cues in caregiver's speech are considered. During agent-caregiver interaction, the agent selects a word from the caregiver's utterance and learns the relations between that word and the objects in its visual environment. The “novel words to novel objects” language-specific constraint is assumed for computing rewards. The models are learned by maximizing the expected reward using reinforcement learning algorithms [i.e., table-based algorithms: Q-learning, SARSA, SARSA-λ, and neural network-based algorithms: Q-learning for neural network (Q-NN), neural-fitted Q-network (NFQ), and deep Q-network (DQN)]. Neural network-based reinforcement learning models are chosen over table-based models for better generalization and quicker convergence. Simulations are carried out using mother-infant interaction CHILDES dataset for learning word-object pairings. Reinforcement is modeled in two cross-situational learning cases: (1) with joint attention (Attentional models), and (2) with joint attention and prosodic cues (Attentional-prosodic models). Attentional-prosodic models manifest superior performance to Attentional ones for the task of word-learning. The Attentional-prosodic DQN outperforms existing word-learning models for the same task. PMID:29441027
Foot Structure in Japanese Speech Errors: Normal vs. Pathological
ERIC Educational Resources Information Center
Miyakoda, Haruko
2008-01-01
Although many studies of speech errors have been presented in the literature, most have focused on errors occurring at either the segmental or feature level. Few, if any, studies have dealt with the prosodic structure of errors. This paper aims to fill this gap by taking up the issue of prosodic structure in Japanese speech errors, with a focus on…
ERIC Educational Resources Information Center
Campfield, Dorota E.; Murphy, Victoria A.
2017-01-01
This paper reports on an intervention study with young Polish beginners (mean age: 8 years, 3 months) learning English at school. It seeks to identify whether exposure to rhythmic input improves knowledge of word order and function words. The "prosodic bootstrapping hypothesis", relevant in developmental psycholinguistics, provided the…
The Role of Prosodic Structure in the L2 Acquisition of Spanish Stop Lenition
ERIC Educational Resources Information Center
Cabrelli Amaro, Jennifer
2017-01-01
This study tests the hypothesis that late first-language English / second-language Spanish learners (L1 English / L2 Spanish learners) acquire spirantization in stages according to the prosodic hierarchy (Zampini, 1997, 1998). In Spanish, voiced stops [b d g] surface after a pause or nasal stop, and continuants [ß? ð? ??] surface postvocalically,…
Harmonic Domains and Synchronization in Typically and Atypically Developing Hebrew-Speaking Children
ERIC Educational Resources Information Center
Bat-El, Outi
2009-01-01
This paper presents a comparative study of typical and atypical consonant harmony (onset-onset place harmony), with emphasis on (i) the size of the harmonic domain, (ii) the position of the harmonic domain within the prosodic word, and (iii) the maximal size of the prosodic word that exhibits consonant harmony. The data, drawn from typically and…
Cross-Linguistic Differences in Prosodic Cues to Syntactic Disambiguation in German and English
ERIC Educational Resources Information Center
O'Brien, Mary Grantham; Jackson, Carrie N.; Gardner, Christine E.
2014-01-01
This study examined whether late-learning English-German second language (L2) learners and late-learning German-English L2 learners use prosodic cues to disambiguate temporarily ambiguous first language and L2 sentences during speech production. Experiments 1a and 1b showed that English-German L2 learners and German-English L2 learners used a…
ERIC Educational Resources Information Center
Choe, Wook Kyung
2013-01-01
The current dissertation represents one of the first systematic studies of the distribution of speech errors within supralexical prosodic units. Four experiments were conducted to gain insight into the specific role of these units in speech planning and production. The first experiment focused on errors in adult English. These were found to be…
ERIC Educational Resources Information Center
Simmons, Elizabeth Schoen; Paul, Rhea; Shic, Frederick
2016-01-01
This study examined the acceptability of a mobile application, "SpeechPrompts," designed to treat prosodic disorders in children with ASD and other communication impairments. Ten speech-language pathologists (SLPs) in public schools and 40 of their students, 5-19 years with prosody deficits participated. Students received treatment with…
ERIC Educational Resources Information Center
Li, Jackie P. W.; Law, Thomas; Lam, Gary Y. H.; To, Carol K. S.
2013-01-01
English-speaking children with Autism Spectrum Disorders (ASD) are less capable of using prosodic cues such as intonation for irony comprehension. Prosodic cues, in particular intonation, in Cantonese are relatively restricted while sentence-final particles (SFPs) may be used for this pragmatic function. This study investigated the use of prosodic…
Ultrasound visual feedback treatment and practice variability for residual speech sound errors
Preston, Jonathan L.; McCabe, Patricia; Rivera-Campos, Ahmed; Whittle, Jessica L.; Landry, Erik; Maas, Edwin
2014-01-01
Purpose The goals were to (1) test the efficacy of a motor-learning based treatment that includes ultrasound visual feedback for individuals with residual speech sound errors, and (2) explore whether the addition of prosodic cueing facilitates speech sound learning. Method A multiple baseline single subject design was used, replicated across 8 participants. For each participant, one sound context was treated with ultrasound plus prosodic cueing for 7 sessions, and another sound context was treated with ultrasound but without prosodic cueing for 7 sessions. Sessions included ultrasound visual feedback as well as non-ultrasound treatment. Word-level probes assessing untreated words were used to evaluate retention and generalization. Results For most participants, increases in accuracy of target sound contexts at the word level were observed with the treatment program regardless of whether prosodic cueing was included. Generalization between onset singletons and clusters was observed, as well as generalization to sentence-level accuracy. There was evidence of retention during post-treatment probes, including at a two-month follow-up. Conclusions A motor-based treatment program that includes ultrasound visual feedback can facilitate learning of speech sounds in individuals with residual speech sound errors. PMID:25087938
Infant-Mother Acoustic-Prosodic Alignment and Developmental Risk.
Seidl, Amanda; Cristia, Alejandrina; Soderstrom, Melanie; Ko, Eon-Suk; Abel, Emily A; Kellerman, Ashleigh; Schwichtenberg, A J
2018-06-19
One promising early marker for autism and other communicative and language disorders is early infant speech production. Here we used daylong recordings of high- and low-risk infant-mother dyads to examine whether acoustic-prosodic alignment as well as two automated measures of infant vocalization are related to developmental risk status indexed via familial risk and developmental progress at 36 months of age. Automated analyses of the acoustics of daylong real-world interactions were used to examine whether pitch characteristics of one vocalization by the mother or the child predicted those of the vocalization response by the other speaker and whether other features of infants' speech in daylong recordings were associated with developmental risk status or outcomes. Low-risk and high-risk dyads did not differ in the level of acoustic-prosodic alignment, which was overall not significant. Further analyses revealed that acoustic-prosodic alignment did not predict infants' later developmental progress, which was, however, associated with two automated measures of infant vocalizations (daily vocalizations and conversational turns). Although further research is needed, these findings suggest that automated measures of vocalizations drawn from daylong recordings are a possible early identification tool for later developmental progress/concerns. https://osf.io/cdn3v/.
NASA Astrophysics Data System (ADS)
Kasyidi, Fatan; Puji Lestari, Dessi
2018-03-01
One of the important aspects in human to human communication is to understand emotion of each party. Recently, interactions between human and computer continues to develop, especially affective interaction where emotion recognition is one of its important components. This paper presents our extended works on emotion recognition of Indonesian spoken language to identify four main class of emotions: Happy, Sad, Angry, and Contentment using combination of acoustic/prosodic features and lexical features. We construct emotion speech corpus from Indonesia television talk show where the situations are as close as possible to the natural situation. After constructing the emotion speech corpus, the acoustic/prosodic and lexical features are extracted to train the emotion model. We employ some machine learning algorithms such as Support Vector Machine (SVM), Naive Bayes, and Random Forest to get the best model. The experiment result of testing data shows that the best model has an F-measure score of 0.447 by using only the acoustic/prosodic feature and F-measure score of 0.488 by using both acoustic/prosodic and lexical features to recognize four class emotion using the SVM RBF Kernel.
Teaching tone and intonation with the Prosody Workstation using schematic versus veridical contours
NASA Astrophysics Data System (ADS)
Allen, George D.; Eulenberg, John B.
2004-05-01
Prosodic features of speech (e.g., intonation and rhythm) are often challenging for adults to learn. Most computerized teaching tools, developed to help learners mimic model prosodic patterns, display lines representing the veridical (actual) acoustic fundamental frequency and intensity of the model speech. However, a veridical display may not be optimal for this task. Instead, stereotypical representations (e.g., simplified level or slanting lines) may help by reducing the amount of potentially distracting information. The Prosody Workstation (PW) permits the prosodic contours of both models and users' responses to be displayed using either veridical or stereotypical contours. Users are informed by both visual displays and scores representing the degree of match of their utterance to the model. American English-speaking undergraduates are being studied learning the tone contours and rhythm of Chinese and Hausa utterances ranging in length from two to six syllables. Data include (a) accuracy of mimicking of the models' prosodic contours, measured by the PW; (b) quality of tonal and rhythmic production, judged by native speaker listeners; and (c) learners' perceptions of the ease of the task, measured by a questionnaire at the end of each session.
The relationship between form and function level receptive prosodic abilities in autism.
Järvinen-Pasley, Anna; Peppé, Susan; King-Smith, Gavin; Heaton, Pamela
2008-08-01
Prosody can be conceived as having form (auditory-perceptual characteristics) and function (pragmatic/linguistic meaning). No known studies have examined the relationship between form- and function-level prosodic skills in relation to the effects of stimulus length and/or complexity upon such abilities in autism. Research in this area is both insubstantial and inconclusive. Children with autism and controls completed the receptive tasks of the Profiling Elements of Prosodic Systems in Children (PEPS-C) test, which examines both form- and function-level skills, and a sentence-level task assessing the understanding of intonation. While children with autism were unimpaired in both form and function tasks at the single-word level, they showed significantly poorer performance in the corresponding sentence-level tasks than controls. Implications for future research are discussed.
NASA Astrophysics Data System (ADS)
Imai, Emiko; Katagiri, Yoshitada; Seki, Keiko; Kawamata, Toshio
2011-06-01
We present a neural model of the production of modulated speech streams in the brain, referred to as prosody, which indicates the limbic structure essential for producing prosody both linguistically and emotionally. This model suggests that activating the fundamental brain including monoamine neurons at the basal ganglia will potentially contribute to helping patients with prosodic disorders coming from functional defects of the fundamental brain to overcome their speech problem. To establish effective clinical treatment for such prosodic disorders, we examine how sounds affect the fundamental activity by using electroencephalographic measurements. Throughout examinations with various melodious sounds, we found that some melodies with lilting rhythms successfully give rise to the fast alpha rhythms at the electroencephalogram which reflect the fundamental brain activity without any negative feelings.
Tone and prosodic organization in Cherokee nouns
NASA Astrophysics Data System (ADS)
Johnson, Keith; Haag, Marcia
2005-04-01
Preliminary observations in the speech of one speaker of Cherokee led us to postulate three factors affecting tone in Cherokee. (1) Tone may be lexically specified with distinctive low, low fall, low rise, and high tones. (2) There is a metrically determined high fall pattern which may be distributed over not more than 2 syllables from the right edge of a prosodic domain. (3) Intonational domains may be associated with discourse functions, marked by high fall, or by pitch range upstep. This paper tests these observations in recordings of word lists and sentences produced by five additional speakers. The analysis we give, positing both lexical tone and metrical prosodic accent, is not unique in descriptions of language, but is different from the usual description of Cherokee. [Work supported by NSF.
Sheppard, Shannon M; Love, Tracy; Midgley, Katherine J; Holcomb, Phillip J; Shapiro, Lewis P
2017-12-01
Event-related potentials (ERPs) were used to examine how individuals with aphasia and a group of age-matched controls use prosody and themattic fit information in sentences containing temporary syntactic ambiguities. Two groups of individuals with aphasia were investigated; those demonstrating relatively good sentence comprehension whose primary language difficulty is anomia (Individuals with Anomic Aphasia (IWAA)), and those who demonstrate impaired sentence comprehension whose primary diagnosis is Broca's aphasia (Individuals with Broca's Aphasia (IWBA)). The stimuli had early closure syntactic structure and contained a temporary early closure (correct)/late closure (incorrect) syntactic ambiguity. The prosody was manipulated to either be congruent or incongruent, and the temporarily ambiguous NP was also manipulated to either be a plausible or an implausible continuation for the subordinate verb (e.g., "While the band played the song/the beer pleased all the customers."). It was hypothesized that an implausible NP in sentences with incongruent prosody may provide the parser with a plausibility cue that could be used to predict syntactic structure. The results revealed that incongruent prosody paired with a plausibility cue resulted in an N400-P600 complex at the implausible NP (the beer) in both the controls and the IWAAs, yet incongruent prosody without a plausibility cue resulted in an N400-P600 at the critical verb (pleased) only in healthy controls. IWBAs did not show evidence of N400 or P600 effects at the ambiguous NP or critical verb, although they did show evidence of a delayed N400 effect at the sentence-final word in sentences with incongruent prosody. These results suggest that IWAAs have difficulty integrating prosodic cues with underlying syntactic structure when lexical-semantic information is not available to aid their parse. IWBAs have difficulty integrating both prosodic and lexical-semantic cues with syntactic structure, likely due to a processing delay. Copyright © 2017 Elsevier Ltd. All rights reserved.
The coordination of boundary tones and its interaction with prominence.
Katsika, Argyro; Krivokapić, Jelena; Mooshammer, Christine; Tiede, Mark; Goldstein, Louis
2014-05-01
This study investigates the coordination of boundary tones as a function of stress and pitch accent. Boundary tone coordination has not been experimentally investigated previously, and the effect of prominence on this coordination, and whether it is lexical (stress-driven) or phrasal (pitch accent-driven) in nature is unclear. We assess these issues using a variety of syntactic constructions to elicit different boundary tones in an Electromagnetic Articulography (EMA) study of Greek. The results indicate that the onset of boundary tones co-occurs with the articulatory target of the final vowel. This timing is further modified by stress, but not by pitch accent: boundary tones are initiated earlier in words with non-final stress than in words with final stress regardless of accentual status. Visual data inspection reveals that phrase-final words are followed by acoustic pauses during which specific articulatory postures occur. Additional analyses show that these postures reach their achievement point at a stable temporal distance from boundary tone onsets regardless of stress position. Based on these results and parallel findings on boundary lengthening reported elsewhere, a novel approach to prosody is proposed within the context of Articulatory Phonology: rather than seeing prosodic (lexical and phrasal) events as independent entities, a set of coordination relations between them is suggested. The implications of this account for prosodic architecture are discussed.
ERIC Educational Resources Information Center
Ormel, Ellen; Crasborn, Onno
2012-01-01
This article contains a literature review of evidence of large prosodic domains that correspond to syntactic units such as a clause or a sentence. In particular, different phonetic nonmanual cues that may relate to clause or sentence boundaries are discussed in detail. On the basis of various ideas and views in the literature, we also describe two…
ERIC Educational Resources Information Center
Fredman, Traci
2017-01-01
Clinical Question: For children ages birth to 3 years diagnosed with a language delay or disorder, to what extent does the prosodic component of motherese aid in establishing joint attention (JA)? Method: Systematic Review. Study Sources: ASHA, Web of Science, CINAHL, MEDLINE, EBSCO, PubMed, PsycINFO, and ERIC. Search Terms: motherese, infant…
ERIC Educational Resources Information Center
Kariuki, Patrick; Baxter, Andrew
2011-01-01
The purpose of this study was to examine the relationship between prosodic oral reading and overall reading comprehension of second grade students. The sample consisted of ten students who were randomly selected from a second grade classroom. The students read aloud from the book "Bedtime for Frances" by Lillian and Russell Hoban for a…
Analysis of engagement behavior in children during dyadic interactions using prosodic cues⋆
Gupta, Rahul; Bone, Daniel; Lee, Sungbok; Narayanan, Shrikanth
2017-01-01
Child engagement is defined as the interaction of a child with his/her environment in a contextually appropriate manner. Engagement behavior in children is linked to socio-emotional and cognitive state assessment with enhanced engagement identified with improved skills. A vast majority of studies however rely solely, and often implicitly, on subjective perceptual measures of engagement. Access to automatic quantification could assist researchers/clinicians to objectively interpret engagement with respect to a target behavior or condition, and furthermore inform mechanisms for improving engagement in various settings. In this paper, we present an engagement prediction system based exclusively on vocal cues observed during structured interaction between a child and a psychologist involving several tasks. Specifically, we derive prosodic cues that capture engagement levels across the various tasks. Our experiments suggest that a child’s engagement is reflected not only in the vocalizations, but also in the speech of the interacting psychologist. Moreover, we show that prosodic cues are informative of the engagement phenomena not only as characterized over the entire task (i.e., global cues), but also in short term patterns (i.e., local cues). We perform a classification experiment assigning the engagement of a child into three discrete levels achieving an unweighted average recall of 55.8% (chance is 33.3%). While the systems using global cues and local level cues are each statistically significant in predicting engagement, we obtain the best results after fusing these two components. We perform further analysis of the cues at local and global levels to achieve insights linking specific prosodic patterns to the engagement phenomenon. We observe that while the performance of our model varies with task setting and interacting psychologist, there exist universal prosodic patterns reflective of engagement. PMID:28713198
Irurtzun, Aritz
2015-01-01
In recent research (Boeckx and Benítez-Burraco, 2014a,b) have advanced the hypothesis that our species-specific language-ready brain should be understood as the outcome of developmental changes that occurred in our species after the split from Neanderthals-Denisovans, which resulted in a more globular braincase configuration in comparison to our closest relatives, who had elongated endocasts. According to these authors, the development of a globular brain is an essential ingredient for the language faculty and in particular, it is the centrality occupied by the thalamus in a globular brain that allows its modulatory or regulatory role, essential for syntactico-semantic computations. Their hypothesis is that the syntactico-semantic capacities arise in humans as a consequence of a process of globularization, which significantly takes place postnatally (cf. Neubauer et al., 2010). In this paper, I show that Boeckx and Benítez-Burraco's hypothesis makes an interesting developmental prediction regarding the path of language acquisition: it teases apart the onset of phonological acquisition and the onset of syntactic acquisition (the latter starting significantly later, after globularization). I argue that this hypothesis provides a developmental rationale for the prosodic bootstrapping hypothesis of language acquisition (cf. i.a. Gleitman and Wanner, 1982; Mehler et al., 1988, et seq.; Gervain and Werker, 2013), which claim that prosodic cues are employed for syntactic parsing. The literature converges in the observation that a large amount of such prosodic cues (in particular, rhythmic cues) are already acquired before the completion of the globularization phase, which paves the way for the premises of the prosodic bootstrapping hypothesis, allowing babies to have a rich knowledge of the prosody of their target language before they can start parsing the primary linguistic data syntactically.
Analysis of engagement behavior in children during dyadic interactions using prosodic cues.
Gupta, Rahul; Bone, Daniel; Lee, Sungbok; Narayanan, Shrikanth
2016-05-01
Child engagement is defined as the interaction of a child with his/her environment in a contextually appropriate manner. Engagement behavior in children is linked to socio-emotional and cognitive state assessment with enhanced engagement identified with improved skills. A vast majority of studies however rely solely, and often implicitly, on subjective perceptual measures of engagement. Access to automatic quantification could assist researchers/clinicians to objectively interpret engagement with respect to a target behavior or condition, and furthermore inform mechanisms for improving engagement in various settings. In this paper, we present an engagement prediction system based exclusively on vocal cues observed during structured interaction between a child and a psychologist involving several tasks. Specifically, we derive prosodic cues that capture engagement levels across the various tasks. Our experiments suggest that a child's engagement is reflected not only in the vocalizations, but also in the speech of the interacting psychologist. Moreover, we show that prosodic cues are informative of the engagement phenomena not only as characterized over the entire task (i.e., global cues), but also in short term patterns (i.e., local cues). We perform a classification experiment assigning the engagement of a child into three discrete levels achieving an unweighted average recall of 55.8% (chance is 33.3%). While the systems using global cues and local level cues are each statistically significant in predicting engagement, we obtain the best results after fusing these two components. We perform further analysis of the cues at local and global levels to achieve insights linking specific prosodic patterns to the engagement phenomenon. We observe that while the performance of our model varies with task setting and interacting psychologist, there exist universal prosodic patterns reflective of engagement.
Cumming, Ruth; Wilson, Angela; Goswami, Usha
2015-01-01
Children with specific language impairments (SLIs) show impaired perception and production of spoken language, and can also present with motor, auditory, and phonological difficulties. Recent auditory studies have shown impaired sensitivity to amplitude rise time (ART) in children with SLIs, along with non-speech rhythmic timing difficulties. Linguistically, these perceptual impairments should affect sensitivity to speech prosody and syllable stress. Here we used two tasks requiring sensitivity to prosodic structure, the DeeDee task and a stress misperception task, to investigate this hypothesis. We also measured auditory processing of ART, rising pitch and sound duration, in both speech (“ba”) and non-speech (tone) stimuli. Participants were 45 children with SLI aged on average 9 years and 50 age-matched controls. We report data for all the SLI children (N = 45, IQ varying), as well as for two independent SLI subgroupings with intact IQ. One subgroup, “Pure SLI,” had intact phonology and reading (N = 16), the other, “SLI PPR” (N = 15), had impaired phonology and reading. Problems with syllable stress and prosodic structure were found for all the group comparisons. Both sub-groups with intact IQ showed reduced sensitivity to ART in speech stimuli, but the PPR subgroup also showed reduced sensitivity to sound duration in speech stimuli. Individual differences in processing syllable stress were associated with auditory processing. These data support a new hypothesis, the “prosodic phrasing” hypothesis, which proposes that grammatical difficulties in SLI may reflect perceptual difficulties with global prosodic structure related to auditory impairments in processing amplitude rise time and duration. PMID:26217286
Menninghaus, Winfried; Bohrn, Isabel C; Knoop, Christine A; Kotz, Sonja A; Schlotz, Wolff; Jacobs, Arthur M
2015-10-01
Studies on rhetorical features of language have reported both enhancing and adverse effects on ease of processing. We hypothesized that two explanations may account for these inconclusive findings. First, the respective gains and losses in ease of processing may apply to different dimensions of language processing (specifically, prosodic and semantic processing) and different types of fluency (perceptual vs. conceptual) and may well allow for an integration into a more comprehensive framework. Second, the effects of rhetorical features may be sensitive to interactions with other rhetorical features; employing a feature separately or in combination with others may then predict starkly different effects. We designed a series of experiments in which we expected the same rhetorical features of the very same sentences to exert adverse effects on semantic (conceptual) fluency and enhancing effects on prosodic (perceptual) fluency. We focused on proverbs that each employ three rhetorical features: rhyme, meter, and brevitas (i.e., artful shortness). The presence of these target features decreased ease of conceptual fluency (semantic comprehension) while enhancing perceptual fluency as reflected in beauty and succinctness ratings that were mainly driven by prosodic features. The rhetorical features also predicted choices for persuasive purposes, yet only for the sentence versions featuring all three rhetorical features; the presence of only one or two rhetorical features had an adverse effect on the choices made. We suggest that the facilitating effects of a combination of rhyme, meter, and rhetorical brevitas on perceptual (prosodic) fluency overcompensated for their adverse effects on conceptual (semantic) fluency, thus resulting in a total net gain both in processing ease and in choices for persuasive purposes. Copyright © 2015 The Authors. Published by Elsevier B.V. All rights reserved.
Simmons, Elizabeth Schoen; Paul, Rhea; Shic, Frederick
2016-01-01
This study examined the acceptability of a mobile application, SpeechPrompts, designed to treat prosodic disorders in children with ASD and other communication impairments. Ten speech-language pathologists (SLPs) in public schools and 40 of their students, 5-19 years with prosody deficits participated. Students received treatment with the software over eight weeks. Pre- and post-treatment speech samples and student engagement data were collected. Feedback on the utility of the software was also obtained. SLPs implemented the software with their students in an authentic education setting. Student engagement ratings indicated students' attention to the software was maintained during treatment. Although more testing is warranted, post-treatment prosody ratings suggest that SpeechPrompts has potential to be a useful tool in the treatment of prosodic disorders.
NASA Astrophysics Data System (ADS)
de Jong, Kenneth; Silbert, Noah; Park, Hanyong
2004-05-01
Experimental models of cross-language perception and second-language acquisition (such as PAM and SLM) typically treat language differences in terms of whether the two languages share phonological segmental categories. Linguistic models, by contrast, generally examine properties which cross classify segments, such as features, rules, or prosodic constraints. Such models predict that perceptual patterns found for one segment will generalize to other segments of the same class. This paper presents perceptual identifications of Korean listeners to a set of voiced and voiceless English stops and fricatives in various prosodic locations to determine the extent to which such generality occurs. Results show some class-general effects; for example, voicing identification patterns generalize from stops, which occur in Korean, to nonsibilant fricatives, which are new to Korean listeners. However, when identification is poor, there are clear differences between segments within the same class. For example, in identifying stops and fricatives, both point of articulation and prosodic position bias perceptions; coronals are more often labeled fricatives, and syllable initial obstruents are more often labeled stops. These results suggest that class-general perceptual patterns are not a simple consequence of the structure of the perceptual system, but need to be acquired by factoring out within-class differences.
ERIC Educational Resources Information Center
Gabriel, Christoph; Kireva, Elena
2014-01-01
A remarkable example of Spanish-Italian contact is the Spanish variety spoken in Buenos Aires (Porteño), which is said to be prosodically "Italianized" due to migration-induced contact. The change in Porteño prosody has been interpreted as a result of transfer from the first language (L1) that occurred when Italian immigrants learned…
ERIC Educational Resources Information Center
Braarud, Hanne Cecilie; Stormark, Kjell Morten
2008-01-01
The purpose of this study was to examine 32 mothers' sensitivity to social contingency during face-to-face interaction with their two- to four-month-old infants in a closed circuit TV set-up. Prosodic qualities and vocal sounds in mother's infant-directed (ID) speech during sequences of live interaction were compared to sequences where expressive…
Ferré, Perrine; Ska, Bernadette; Lajoie, Camille; Bleau, Amélie; Joanette, Yves
2011-01-01
Researchers and clinicians acknowledge today that the contribution of both cerebral hemispheres is necessary to a full and adequate verbal communication. Indeed, it is estimated that at least 50% of right brain damaged individuals display impairments of prosodic, discourse, pragmatics and/or lexical semantics dimensions of communication. Since the 1990's, researchers have focused on the description and the assessment of these impairments and it is only recently that authors have shown interest in planning specific intervention approaches. However, therapists in rehabilitation settings still have very few available tools. This review of recent literature demonstrates that, even though theoretical knowledge needs further methodological investigation, intervention guidelines can be identified to target right hemisphere damage communication impairments in clinical practice. These principles can be incorporated by speech and language pathologists, in a structured intervention framework, aiming at fully addressing prosodic, discursive and pragmatic components of communication. PMID:22110970
Prosodic alignment in human-computer interaction
NASA Astrophysics Data System (ADS)
Suzuki, N.; Katagiri, Y.
2007-06-01
Androids that replicate humans in form also need to replicate them in behaviour to achieve a high level of believability or lifelikeness. We explore the minimal social cues that can induce in people the human tendency for social acceptance, or ethopoeia, toward artifacts, including androids. It has been observed that people exhibit a strong tendency to adjust to each other, through a number of speech and language features in human-human conversational interactions, to obtain communication efficiency and emotional engagement. We investigate in this paper the phenomena related to prosodic alignment in human-computer interactions, with particular focus on human-computer alignment of speech characteristics. We found that people exhibit unidirectional and spontaneous short-term alignment of loudness and response latency in their speech in response to computer-generated speech. We believe this phenomenon of prosodic alignment provides one of the key components for building social acceptance of androids.
Domahs, Ulrike; Knaus, Johannes A.; El Shanawany, Heba; Wiese, Richard
2014-01-01
This article presents neurolinguistic data on word stress perception in Cairene Arabic, in comparison to previous results on German and Turkish. The main goal is to investigate how central properties of stress systems such as predictability of stress and metrical structure are reflected in the prosodic processing of words. Cairene Arabic is a language with a regular foot-based word stress system, leading to highly predictable placement of word stress. An ERP study on Cairene Arabic is reported, in which a stress violation paradigm is used to investigate the factors predictability of stress and foot structure. The results of the experiment show that for Cairene Arabic the internal structure of prosodic words in terms of feet determines prosodic processing. This structure effect is complemented by a frequency effect for stress patterns. PMID:25374546
Does incongruence of lexicosemantic and prosodic information cause discernible cognitive conflict?
Mitchell, Rachel L C
2006-12-01
We are often required to interpret discordant emotional signals. Whereas equivalent cognitive paradigms cause noticeable conflict via their behavioral and psychophysiological effects, the same may not necessarily be true for discordant emotions. Skin conductance responses (SCRs) and heart rates (HRs) were measured during a classic Stroop task and one in which the emotions conveyed by lexicosemantic content and prosody were congruent or incongruent. The participants' task was to identify the emotion conveyed by lexicosemantic content or prosody. No relationship was observed between HR and congruence. SCR was higher during incongruent than during congruent conditions of the experimental task (as well as in the classic Stroop task), but no difference in SCR was observed in a comparison between congruence effects during lexicosemantic emotion identification and those during prosodic emotion identification. It is concluded that incongruence between lexicosemantic and prosodic emotion does cause notable cognitive conflict. Functional neuroanatomic implications are discussed.
Prosodic disambiguation of noun/verb homophones in child-directed speech.
Conwell, Erin
2017-05-01
One strategy that children might use to sort words into grammatical categories such as noun and verb is distributional bootstrapping, in which local co-occurrence information is used to distinguish between categories. Words that can be used in more than one grammatical category could be problematic for this approach. Using naturalistic corpus data, this study asks whether noun and verb uses of ambiguous words might differ prosodically as a function of their grammatical category in child-directed speech. The results show that noun and verb uses of ambiguous words in sentence-medial positions do differ from one another in terms of duration, vowel duration, pitch change, and vowel quality measures. However, sentence-final tokens are not different as a function of the category in which they were used. The availability of prosodic cues to category in natural child-directed speech could allow learners using a distributional bootstrapping approach to avoid conflating grammatical categories.
Tong, Xiuli; He, Xinjie; Deacon, S Hélène
2017-02-01
Languages differ considerably in how they use prosodic features, or variations in pitch, duration, and intensity, to distinguish one word from another. Prosodic features include lexical tone in Chinese and lexical stress in English. Recent cross-sectional studies show a surprising result that Mandarin Chinese tone sensitivity is related to Mandarin-English bilingual children's English word reading. This study explores the mechanism underlying this relation by testing two explanations of these effects: the prosodic hypothesis and segmental phonological awareness transfer. We administered multiple measures of Cantonese tone sensitivity, English stress sensitivity, segmental phonological awareness in Cantonese and English, nonverbal ability, and English word reading to 123 Cantonese-English bilingual children ages 7 and 8 years. Structural equation modeling revealed a longitudinal prediction of Cantonese tone sensitivity to English word reading between 8 and 9 years of age. This relation was realized through two parallel routes. In one, Cantonese tone sensitivity predicted English stress sensitivity, and English stress sensitivity, in turn, significantly predicted English word reading, as postulated by the prosodic hypothesis. In the second, Cantonese tone sensitivity predicted English word reading through the transfer of segmental phonological awareness between Cantonese and English, as predicted by segmental phonological transfer. These results support a unified model of phonological transfer, emphasizing the role of tone in English word reading for Cantonese-English bilingual children.
The coordination of boundary tones and its interaction with prominence1
Katsika, Argyro; Krivokapić, Jelena; Mooshammer, Christine; Tiede, Mark; Goldstein, Louis
2014-01-01
This study investigates the coordination of boundary tones as a function of stress and pitch accent. Boundary tone coordination has not been experimentally investigated previously, and the effect of prominence on this coordination, and whether it is lexical (stress-driven) or phrasal (pitch accent-driven) in nature is unclear. We assess these issues using a variety of syntactic constructions to elicit different boundary tones in an Electromagnetic Articulography (EMA) study of Greek. The results indicate that the onset of boundary tones co-occurs with the articulatory target of the final vowel. This timing is further modified by stress, but not by pitch accent: boundary tones are initiated earlier in words with non-final stress than in words with final stress regardless of accentual status. Visual data inspection reveals that phrase-final words are followed by acoustic pauses during which specific articulatory postures occur. Additional analyses show that these postures reach their achievement point at a stable temporal distance from boundary tone onsets regardless of stress position. Based on these results and parallel findings on boundary lengthening reported elsewhere, a novel approach to prosody is proposed within the context of Articulatory Phonology: rather than seeing prosodic (lexical and phrasal) events as independent entities, a set of coordination relations between them is suggested. The implications of this account for prosodic architecture are discussed. PMID:25300341
Weber-Fox, Christine; Hart, Laura J; Spruill, John E
2006-07-01
This study examined how school-aged children process different grammatical categories. Event-related brain potentials elicited by words in visually presented sentences were analyzed according to seven grammatical categories with naturally varying characteristics of linguistic functions, semantic features, and quantitative attributes of length and frequency. The categories included nouns, adjectives, verbs, pronouns, conjunctions, prepositions, and articles. The findings indicate that by the age of 9-10 years, children exhibit robust neural indicators differentiating grammatical categories; however, it is also evident that development of language processing is not yet adult-like at this age. The current findings are consistent with the hypothesis that for beginning readers a variety of cues and characteristics interact to affect processing of different grammatical categories and indicate the need to take into account linguistic functions, prosodic salience, and grammatical complexity as they relate to the development of language abilities.
[Prosody, speech input and language acquisition].
Jungheim, M; Miller, S; Kühn, D; Ptok, M
2014-04-01
In order to acquire language, children require speech input. The prosody of the speech input plays an important role. In most cultures adults modify their code when communicating with children. Compared to normal speech this code differs especially with regard to prosody. For this review a selective literature search in PubMed and Scopus was performed. Prosodic characteristics are a key feature of spoken language. By analysing prosodic features, children gain knowledge about underlying grammatical structures. Child-directed speech (CDS) is modified in a way that meaningful sequences are highlighted acoustically so that important information can be extracted from the continuous speech flow more easily. CDS is said to enhance the representation of linguistic signs. Taking into consideration what has previously been described in the literature regarding the perception of suprasegmentals, CDS seems to be able to support language acquisition due to the correspondence of prosodic and syntactic units. However, no findings have been reported, stating that the linguistically reduced CDS could hinder first language acquisition.
The assessment and treatment of prosodic disorders and neurological theories of prosody.
Diehl, Joshua J; Paul, Rhea
2009-08-01
In this article, we comment on specific aspects of Peppé (Peppé, 2009). In particular, we address the assessment and treatment of prosody in clinical settings and discuss current theory on neurological models of prosody. We argue that in order for prosodic assessment instruments and treatment programs to be clinical effective, we need assessment instruments that: (1) have a representative normative comparison sample and strong psychometric properties; (2) are based on empirical information regarding the typical sequence of prosodic acquisition and are sensitive to developmental change; (3) meaningfully subcategorize various aspects of prosody; (4) use tasks that have ecological validity; and (5) have clinical properties, such as length and ease of administration, that allow them to become part of standard language assessment batteries. In addition, we argue that current theories of prosody processing in the brain are moving toward network models that involve multiple brain areas and are crucially dependent on cortical communication. The implications of these observations for future research and clinical practice are outlined.
Guidi, Andrea; Salvi, Sergio; Ottaviano, Manuel; Gentili, Claudio; Bertschy, Gilles; de Rossi, Danilo; Scilingo, Enzo Pasquale; Vanello, Nicola
2015-11-06
Bipolar disorder is one of the most common mood disorders characterized by large and invalidating mood swings. Several projects focus on the development of decision support systems that monitor and advise patients, as well as clinicians. Voice monitoring and speech signal analysis can be exploited to reach this goal. In this study, an Android application was designed for analyzing running speech using a smartphone device. The application can record audio samples and estimate speech fundamental frequency, F0, and its changes. F0-related features are estimated locally on the smartphone, with some advantages with respect to remote processing approaches in terms of privacy protection and reduced upload costs. The raw features can be sent to a central server and further processed. The quality of the audio recordings, algorithm reliability and performance of the overall system were evaluated in terms of voiced segment detection and features estimation. The results demonstrate that mean F0 from each voiced segment can be reliably estimated, thus describing prosodic features across the speech sample. Instead, features related to F0 variability within each voiced segment performed poorly. A case study performed on a bipolar patient is presented.
Guidi, Andrea; Salvi, Sergio; Ottaviano, Manuel; Gentili, Claudio; Bertschy, Gilles; de Rossi, Danilo; Scilingo, Enzo Pasquale; Vanello, Nicola
2015-01-01
Bipolar disorder is one of the most common mood disorders characterized by large and invalidating mood swings. Several projects focus on the development of decision support systems that monitor and advise patients, as well as clinicians. Voice monitoring and speech signal analysis can be exploited to reach this goal. In this study, an Android application was designed for analyzing running speech using a smartphone device. The application can record audio samples and estimate speech fundamental frequency, F0, and its changes. F0-related features are estimated locally on the smartphone, with some advantages with respect to remote processing approaches in terms of privacy protection and reduced upload costs. The raw features can be sent to a central server and further processed. The quality of the audio recordings, algorithm reliability and performance of the overall system were evaluated in terms of voiced segment detection and features estimation. The results demonstrate that mean F0 from each voiced segment can be reliably estimated, thus describing prosodic features across the speech sample. Instead, features related to F0 variability within each voiced segment performed poorly. A case study performed on a bipolar patient is presented. PMID:26561811
Early sensory encoding of affective prosody: neuromagnetic tomography of emotional category changes.
Thönnessen, Heike; Boers, Frank; Dammers, Jürgen; Chen, Yu-Han; Norra, Christine; Mathiak, Klaus
2010-03-01
In verbal communication, prosodic codes may be phylogenetically older than lexical ones. Little is known, however, about early, automatic encoding of emotional prosody. This study investigated the neuromagnetic analogue of mismatch negativity (MMN) as an index of early stimulus processing of emotional prosody using whole-head magnetoencephalography (MEG). We applied two different paradigms to study MMN; in addition to the traditional oddball paradigm, the so-called optimum design was adapted to emotion detection. In a sequence of randomly changing disyllabic pseudo-words produced by one male speaker in neutral intonation, a traditional oddball design with emotional deviants (10% happy and angry each) and an optimum design with emotional (17% happy and sad each) and nonemotional gender deviants (17% female) elicited the mismatch responses. The emotional category changes demonstrated early responses (<200 ms) at both auditory cortices with larger amplitudes at the right hemisphere. Responses to the nonemotional change from male to female voices emerged later ( approximately 300 ms). Source analysis pointed at bilateral auditory cortex sources without robust contribution from other such as frontal sources. Conceivably, both auditory cortices encode categorical representations of emotional prosodic. Processing of cognitive feature extraction and automatic emotion appraisal may overlap at this level enabling rapid attentional shifts to important social cues. Copyright (c) 2009 Elsevier Inc. All rights reserved.
Detection of Clinical Depression in Adolescents’ Speech During Family Interactions
Low, Lu-Shih Alex; Maddage, Namunu C.; Lech, Margaret; Sheeber, Lisa B.; Allen, Nicholas B.
2013-01-01
The properties of acoustic speech have previously been investigated as possible cues for depression in adults. However, these studies were restricted to small populations of patients and the speech recordings were made during patients’ clinical interviews or fixed-text reading sessions. Symptoms of depression often first appear during adolescence at a time when the voice is changing, in both males and females, suggesting that specific studies of these phenomena in adolescent populations are warranted. This study investigated acoustic correlates of depression in a large sample of 139 adolescents (68 clinically depressed and 71 controls). Speech recordings were made during naturalistic interactions between adolescents and their parents. Prosodic, cepstral, spectral, and glottal features, as well as features derived from the Teager energy operator (TEO), were tested within a binary classification framework. Strong gender differences in classification accuracy were observed. The TEO-based features clearly outperformed all other features and feature combinations, providing classification accuracy ranging between 81%–87% for males and 72%–79% for females. Close, but slightly less accurate, results were obtained by combining glottal features with prosodic and spectral features (67%–69% for males and 70%–75% for females). These findings indicate the importance of nonlinear mechanisms associated with the glottal flow formation as cues for clinical depression. PMID:21075715
The Acquisition of English Focus Marking by Non-Native Speakers
NASA Astrophysics Data System (ADS)
Baker, Rachel Elizabeth
This dissertation examines Mandarin and Korean speakers' acquisition of English focus marking, which is realized by accenting particular words within a focused constituent. It is important for non-native speakers to learn how accent placement relates to focus in English because appropriate accent placement and realization makes a learner's English more native-like and easier to understand. Such knowledge may also improve their English comprehension skills. In this study, 20 native English speakers, 20 native Mandarin speakers, and 20 native Korean speakers participated in four experiments: (1) a production experiment, in which they were recorded reading the answers to questions, (2) a perception experiment, in which they were asked to determine which word in a recording was the last prominent word, (3) an understanding experiment, in which they were asked whether the answers in recorded question-answer pairs had context-appropriate prosody, and (4) an accent placement experiment, in which they were asked which word they would make prominent in a particular context. Finally, a new group of native English speakers listened to utterances produced in the production experiment, and determined whether the prosody of each utterance was appropriate for its context. The results of the five experiments support a novel predictive model for second language prosodic focus marking acquisition. This model holds that both transfer of linguistic features from a learner's native language (L1) and features of their second language (L2) affect learners' acquisition of prosodic focus marking. As a result, the model includes two complementary components: the Transfer Component and the L2 Challenge Component. The Transfer Component predicts that prosodic structures in the L2 will be more easily acquired by language learners that have similar structures in their L1 than those who do not, even if there are differences between the L1 and L2 in how the structures are realized. The L2 Challenge Component predicts that for difficult tasks, language learners will rely on widely-applied prosodic patterns, making them more successful at prosodically marking broad focus than narrow focus. However, for easy tasks, language learners will more successfully mark information structures that have a more direct relationship between focus and accent placement.
Acoustic constituents of prosodic typology
NASA Astrophysics Data System (ADS)
Komatsu, Masahiko
Different languages sound different, and considerable part of it derives from the typological difference of prosody. Although such difference is often referred to as lexical accent types (stress accent, pitch accent, and tone; e.g. English, Japanese, and Chinese respectively) and rhythm types (stress-, syllable-, and mora-timed rhythms; e.g. English, Spanish, and Japanese respectively), it is unclear whether these types are determined in terms of acoustic properties, The thesis intends to provide a potential basis for the description of prosody in terms of acoustics. It argues for the hypothesis that the source component of the source-filter model (acoustic features) approximately corresponds to prosody (linguistic features) through several experimental-phonetic studies. The study consists of four parts. (1) Preliminary experiment: Perceptual language identification tests were performed using English and Japanese speech samples whose frequency spectral information (i.e. non-source component) is heavily reduced. The results indicated that humans can discriminate languages with such signals. (2) Discussion on the linguistic information that the source component contains: This part constitutes the foundation of the argument of the thesis. Perception tests of consonants with the source signal indicated that the source component carries the information on broad categories of phonemes that contributes to the creation of rhythm. (3) Acoustic analysis: The speech samples of Chinese, English, Japanese, and Spanish, differing in prosodic types, were analyzed. These languages showed difference in acoustic characteristics of the source component. (4) Perceptual experiment: A language identification test for the above four languages was performed using the source signal with its acoustic features parameterized. It revealed that humans can discriminate prosodic types solely with the source features and that the discrimination is easier as acoustic information increases. The series of studies showed the correspondence of the source component to prosodic features. In linguistics, prosodic types have not been discussed purely in terms of acoustics; they are usually related to the function of prosody or phonological units such as phonemes. The present thesis focuses on acoustics and makes a contribution to establishing the crosslinguistic description system of prosody.
Effect of Bilingualism on Lexical Stress Pattern Discrimination in French-Learning Infants
Bijeljac-Babic, Ranka; Serres, Josette; Höhle, Barbara; Nazzi, Thierry
2012-01-01
Monolingual infants start learning the prosodic properties of their native language around 6 to 9 months of age, a fact marked by the development of preferences for predominant prosodic patterns and a decrease in sensitivity to non-native prosodic properties. The present study evaluates the effects of bilingual acquisition on speech perception by exploring how stress pattern perception may differ in French-learning 10-month-olds raised in bilingual as opposed to monolingual environments. Experiment 1 shows that monolinguals can discriminate stress patterns following a long familiarization to one of two patterns, but not after a short familiarization. In Experiment 2, two subgroups of bilingual infants growing up learning both French and another language (varying across infants) in which stress is used lexically were tested under the more difficult short familiarization condition: one with balanced input, and one receiving more input in the language other than French. Discrimination was clearly found for the other-language-dominant subgroup, establishing heightened sensitivity to stress pattern contrasts in these bilinguals as compared to monolinguals. However, the balanced bilinguals' performance was not better than that of monolinguals, establishing an effect of the relative balance of the language input. This pattern of results is compatible with the proposal that sensitivity to prosodic contrasts is maintained or enhanced in a bilingual population compared to a monolingual population in which these contrasts are non-native, provided that this dimension is used in one of the two languages in acquisition, and that infants receive enough input from that language. PMID:22363500
Executives' speech expressiveness: analysis of perceptive and acoustic aspects of vocal dynamics.
Marquezin, Daniela Maria Santos Serrano; Viola, Izabel; Ghirardi, Ana Carolina de Assis Moura; Madureira, Sandra; Ferreira, Léslie Piccolotto
2015-01-01
To analyze speech expressiveness in a group of executives based on perceptive and acoustic aspects of vocal dynamics. Four male subjects participated in the research study (S1, S2, S3, and S4). The assessments included the Kingdomality test to obtain the keywords of communicative attitudes; perceptive-auditory assessment to characterize vocal quality and dynamics, performed by three judges who are speech language pathologists; perceptiveauditory assessment to judge the chosen keywords; speech acoustics to assess prosodic elements (Praat software); and a statistical analysis. According to the perceptive-auditory analysis of vocal dynamics, S1, S2, S3, and S4 did not show vocal alterations and all of them were considered with lowered habitual pitch. S1: pointed out as insecure, nonobjective, nonempathetic, and unconvincing with inappropriate use of pauses that are mainly formed by hesitations; inadequate separation of prosodic groups with breaking of syntagmatic constituents. S2: regular use of pauses for respiratory reload, organization of sentences, and emphasis, which is considered secure, little objective, empathetic, and convincing. S3: pointed out as secure, objective, empathetic, and convincing with regular use of pauses for respiratory reload and organization of sentences and hesitations. S4: the most secure, objective, empathetic, and convincing, with proper use of pauses for respiratory reload, planning, and emphasis; prosodic groups agreed with the statement, without separating the syntagmatic constituents. The speech characteristics and communicative attitudes were highlighted in two subjects in a different manner, in such a way that the slow rate of speech and breaks of the prosodic groups transmitted insecurity, little objectivity, and nonpersuasion.
Do patients with schizophrenia use prosody to encode contrastive discourse status?
Michelas, Amandine; Faget, Catherine; Portes, Cristel; Lienhart, Anne-Sophie; Boyer, Laurent; Lançon, Christophe; Champagne-Lavau, Maud
2014-01-01
Patients with schizophrenia (SZ) often display social cognition disorders, including Theory of Mind (ToM) impairments and communication disruptions. Thought language disorders appear to be primarily a disruption of pragmatics, SZ can also experience difficulties at other linguistic levels including the prosodic one. Here, using an interactive paradigm, we showed that SZ individuals did not use prosodic phrasing to encode the contrastive status of discourse referents in French. We used a semi-spontaneous task to elicit noun-adjective pairs in which the noun in the second noun-adjective fragment was identical to the noun in the first fragment (e.g., BONBONS marron “brown candies” vs. BONBONS violets “purple candies”) or could contrast with it (e.g., BOUGIES violettes “purple candles” vs. BONBONS violets “purple candies”). We found that healthy controls parsed the target noun in the second noun-adjective fragment separately from the color adjective, to warn their interlocutor that this noun constituted a contrastive entity (e.g., BOUGIES violettes followed by [BONBONS] [violets]) compared to when it referred to the same object as in the first fragment (e.g., BONBONS marron followed by [BONBONS violets]). On the contrary, SZ individuals did not use prosodic phrasing to encode contrastive status of target nouns. In addition, SZ's difficulties to use prosody of contrast were correlated to their score in a classical ToM task (i.e., the hinting task). Taken together, our data provide evidence that SZ patients exhibit difficulties to prosodically encode discourse statuses and sketch a potential relationship between ToM and the use of linguistic prosody. PMID:25101025
Functional Lateralization of Speech Processing in Adults and Children Who Stutter
Sato, Yutaka; Mori, Koichi; Koizumi, Toshizo; Minagawa-Kawai, Yasuyo; Tanaka, Akihiro; Ozawa, Emi; Wakaba, Yoko; Mazuka, Reiko
2011-01-01
Developmental stuttering is a speech disorder in fluency characterized by repetitions, prolongations, and silent blocks, especially in the initial parts of utterances. Although their symptoms are motor related, people who stutter show abnormal patterns of cerebral hemispheric dominance in both anterior and posterior language areas. It is unknown whether the abnormal functional lateralization in the posterior language area starts during childhood or emerges as a consequence of many years of stuttering. In order to address this issue, we measured the lateralization of hemodynamic responses in the auditory cortex during auditory speech processing in adults and children who stutter, including preschoolers, with near-infrared spectroscopy. We used the analysis–resynthesis technique to prepare two types of stimuli: (i) a phonemic contrast embedded in Japanese spoken words (/itta/ vs. /itte/) and (ii) a prosodic contrast (/itta/ vs. /itta?/). In the baseline blocks, only /itta/ tokens were presented. In phonemic contrast blocks, /itta/ and /itte/ tokens were presented pseudo-randomly, and /itta/ and /itta?/ tokens in prosodic contrast blocks. In adults and children who do not stutter, there was a clear left-hemispheric advantage for the phonemic contrast compared to the prosodic contrast. Adults and children who stutter, however, showed no significant difference between the two stimulus conditions. A subject-by-subject analysis revealed that not a single subject who stutters showed a left advantage in the phonemic contrast over the prosodic contrast condition. These results indicate that the functional lateralization for auditory speech processing is in disarray among those who stutter, even at preschool age. These results shed light on the neural pathophysiology of developmental stuttering. PMID:21687442
The emergence of embedded structure: insights from Kafr Qasem Sign Language
Kastner, Itamar; Meir, Irit; Sandler, Wendy; Dachkovsky, Svetlana
2014-01-01
This paper introduces data from Kafr Qasem Sign Language (KQSL), an as-yet undescribed sign language, and identifies the earliest indications of embedding in this young language. Using semantic and prosodic criteria, we identify predicates that form a constituent with a noun, functionally modifying it. We analyze these structures as instances of embedded predicates, exhibiting what can be regarded as very early stages in the development of subordinate constructions, and argue that these structures may bear directly on questions about the development of embedding and subordination in language in general. Deutscher (2009) argues persuasively that nominalization of a verb is the first step—and the crucial step—toward syntactic embedding. It has also been suggested that prosodic marking may precede syntactic marking of embedding (Mithun, 2009). However, the relevant data from the stage at which embedding first emerges have not previously been available. KQSL might be the missing piece of the puzzle: a language in which a noun can be modified by an additional predicate, forming a proposition within a proposition, sustained entirely by prosodic means. PMID:24917837
Wang, Hsiao-Lan S; Chen, I-Chen; Chiang, Chun-Han; Lai, Ying-Hui; Tsao, Yu
2016-10-01
The current study examined the associations between basic auditory perception, speech prosodic processing, and vocabulary development in Chinese kindergartners, specifically, whether early basic auditory perception may be related to linguistic prosodic processing in Chinese Mandarin vocabulary acquisition. A series of language, auditory, and linguistic prosodic tests were given to 100 preschool children who had not yet learned how to read Chinese characters. The results suggested that lexical tone sensitivity and intonation production were significantly correlated with children's general vocabulary abilities. In particular, tone awareness was associated with comprehensive language development, whereas intonation production was associated with both comprehensive and expressive language development. Regression analyses revealed that tone sensitivity accounted for 36% of the unique variance in vocabulary development, whereas intonation production accounted for 6% of the variance in vocabulary development. Moreover, auditory frequency discrimination was significantly correlated with lexical tone sensitivity, syllable duration discrimination, and intonation production in Mandarin Chinese. Also it provided significant contributions to tone sensitivity and intonation production. Auditory frequency discrimination may indirectly affect early vocabulary development through Chinese speech prosody. © The Author(s) 2016.
Poliva, Oren
2016-01-01
The auditory cortex communicates with the frontal lobe via the middle temporal gyrus (auditory ventral stream; AVS) or the inferior parietal lobule (auditory dorsal stream; ADS). Whereas the AVS is ascribed only with sound recognition, the ADS is ascribed with sound localization, voice detection, prosodic perception/production, lip-speech integration, phoneme discrimination, articulation, repetition, phonological long-term memory and working memory. Previously, I interpreted the juxtaposition of sound localization, voice detection, audio-visual integration and prosodic analysis, as evidence that the behavioral precursor to human speech is the exchange of contact calls in non-human primates. Herein, I interpret the remaining ADS functions as evidence of additional stages in language evolution. According to this model, the role of the ADS in vocal control enabled early Homo (Hominans) to name objects using monosyllabic calls, and allowed children to learn their parents' calls by imitating their lip movements. Initially, the calls were forgotten quickly but gradually were remembered for longer periods. Once the representations of the calls became permanent, mimicry was limited to infancy, and older individuals encoded in the ADS a lexicon for the names of objects (phonological lexicon). Consequently, sound recognition in the AVS was sufficient for activating the phonological representations in the ADS and mimicry became independent of lip-reading. Later, by developing inhibitory connections between acoustic-syllabic representations in the AVS and phonological representations of subsequent syllables in the ADS, Hominans became capable of concatenating the monosyllabic calls for repeating polysyllabic words (i.e., developed working memory). Finally, due to strengthening of connections between phonological representations in the ADS, Hominans became capable of encoding several syllables as a single representation (chunking). Consequently, Hominans began vocalizing and mimicking/rehearsing lists of words (sentences). PMID:27445676
Pettinato, Michèle; Clerck, Ilke De; Verhoeven, Jo; Gillis, Steven
This longitudinal study examined the effect of emerging vocabulary production on the ability to produce the phonetic cues to prosodic prominence in babbled and lexical disyllables of infants with cochlear implants (CI) and normally hearing (NH) infants. Current research on typical language acquisition emphasizes the importance of vocabulary development for phonological and phonetic acquisition. Children with CI experience significant difficulties with the perception and production of prosody, and the role of possible top-down effects is, therefore, particularly relevant for this population. Isolated disyllabic babble and first words were identified and segmented in longitudinal audio-video recordings and transcriptions for nine NH infants and nine infants with CI interacting with their parents. Monthly recordings were included from the onset of babbling until children had reached a cumulative vocabulary of 200 words. Three cues to prosodic prominence, fundamental frequency (f0), intensity, and duration, were measured in the vocalic portions of stand-alone disyllables. To represent the degree of prosodic differentiation between two syllables in an utterance, the raw values for intensity and duration were transformed to ratios, and for f0, a measure of the perceptual distance in semitones was derived. The degree of prosodic differentiation for disyllabic babble and words for each cue was compared between groups. In addition, group and individual tendencies on the types of stress patterns for babble and words were also examined. The CI group had overall smaller pitch and intensity distances than the NH group. For the NH group, words had greater pitch and intensity distances than babbled disyllables. Especially for pitch distance, this was accompanied by a shift toward a more clearly expressed stress pattern that reflected the influence of the ambient language. For the CI group, the same expansion in words did not take place for pitch. For intensity, the CI group gave evidence of some increase of prosodic differentiation. The results for the duration measure showed evidence of utterance final lengthening in both groups. In words, the CI group significantly reduced durational differences between syllables so that a more even-timed, less differentiated pattern emerged. The onset of vocabulary production did not have the same facilitatory effect for the CI infants on the production of phonetic cues for prosody, especially for pitch. It was argued that the results for duration may reflect greater articulatory difficulties in words for the CI group than the NH group. It was suggested that the lack of clear top-down effects of the vocabulary in the CI group may be because of a lag in development caused by an initial lack of auditory stimulation, possibly compounded by the absence of auditory feedback during the babble phase.
Jürgens, Rebecca; Fischer, Julia; Schacht, Annekathrin
2018-01-01
Emotional expressions provide strong signals in social interactions and can function as emotion inducers in a perceiver. Although speech provides one of the most important channels for human communication, its physiological correlates, such as activations of the autonomous nervous system (ANS) while listening to spoken utterances, have received far less attention than in other domains of emotion processing. Our study aimed at filling this gap by investigating autonomic activation in response to spoken utterances that were embedded into larger semantic contexts. Emotional salience was manipulated by providing information on alleged speaker similarity. We compared these autonomic responses to activations triggered by affective sounds, such as exploding bombs, and applause. These sounds had been rated and validated as being either positive, negative, or neutral. As physiological markers of ANS activity, we recorded skin conductance responses (SCRs) and changes of pupil size while participants classified both prosodic and sound stimuli according to their hedonic valence. As expected, affective sounds elicited increased arousal in the receiver, as reflected in increased SCR and pupil size. In contrast, SCRs to angry and joyful prosodic expressions did not differ from responses to neutral ones. Pupil size, however, was modulated by affective prosodic utterances, with increased dilations for angry and joyful compared to neutral prosody, although the similarity manipulation had no effect. These results indicate that cues provided by emotional prosody in spoken semantically neutral utterances might be too subtle to trigger SCR, although variation in pupil size indicated the salience of stimulus variation. Our findings further demonstrate a functional dissociation between pupil dilation and skin conductance that presumably origins from their differential innervation. PMID:29541045
NASA Astrophysics Data System (ADS)
Roth, Wolff-Michael; Tobin, Kenneth
2010-12-01
This ethnographic study of teaching and learning in urban high school science classes investigates the ways in which teachers and students talk, gesture, and use space and time in interaction rituals. In situations where teachers coteach as a means of learning to teach in inner-city schools, successful teacher-teacher collaborations are characterized by prosodic expressions that converge over time and adapt to match the prosodic parameters of students' talk. In these situations our ethnographic data provide evidence of solidarity and positive emotions among the teachers and also between students and teachers. Unsuccessful collaborations are associated with considerable differences in pitch between consecutive speakers participating in turns-at-talk, these being related to the production of negative emotions and conflicts at longer time scales. Situational conflicts are co-expressed by increases in pitch levels, speech intensities, and speech rates; and conflict resolution is accelerated by the coordination of pitch levels. Our study therefore suggests that prosodic alignment and misalignment are resources that are pragmatically deployed to manage face-to-face interactions that have solidarity and conflict as their longer-term outcomes.
Kakouros, Sofoklis; Räsänen, Okko
2016-09-01
Numerous studies have examined the acoustic correlates of sentential stress and its underlying linguistic functionality. However, the mechanism that connects stress cues to the listener's attentional processing has remained unclear. Also, the learnability versus innateness of stress perception has not been widely discussed. In this work, we introduce a novel perspective to the study of sentential stress and put forward the hypothesis that perceived sentence stress in speech is related to the unpredictability of prosodic features, thereby capturing the attention of the listener. As predictability is based on the statistical structure of the speech input, the hypothesis also suggests that stress perception is a result of general statistical learning mechanisms. To study this idea, computational simulations are performed where temporal prosodic trajectories are modeled with an n-gram model. Probabilities of the feature trajectories are subsequently evaluated on a set of novel utterances and compared to human perception of stress. The results show that the low-probability regions of F0 and energy trajectories are strongly correlated with stress perception, giving support to the idea that attention and unpredictability of sensory stimulus are mutually connected. Copyright © 2015 Cognitive Science Society, Inc.
Acoustics of contrastive prosody in children
NASA Astrophysics Data System (ADS)
Patel, Rupal; Piel, Jordan; Grigos, Maria
2005-04-01
Empirical data on the acoustics of prosodic control in children is limited, particularly for linguistically contrastive tasks. Twelve children aged 4, 7, and 11 years were asked to produce two utterances ``Show Bob a bot'' (voiced consonants) and ``Show Pop a pot'' (voiceless consonants) 10 times each with emphasis placed on the second word (Bob/Pop) and 10 times with emphasis placed on the last word (bot/pot). A total of 40 utterances were analyzed per child. The following acoustic measures were obtained for each word within each utterance: average fundamental frequency (f0), peak f0, average intensity, peak intensity, and duration. Preliminary results suggest that 4 year olds are unable to modulate prosodic cues to signal the linguistic contrast. The 7 year olds, however, not only signaled the appropriate stress location, but did so with the most contrastive differences in f0, intensity, and duration, of all age groups. Prosodic differences between stressed and unstressed words were more pronounced for the utterance with voiced consonants. These findings suggest that the acoustics of linguistic prosody begin to differentiate between age 4 and 7 and may be highly influenced by changes in physiological control and flexibility that may also affect segmental features.
The emergence of complexity in prosody and syntax
Meir, Irit; Dachkovsky, Svetlana; Padden, Carol; Aronoff, Mark
2011-01-01
The relation between prosody and syntax is investigated here by tracing the emergence of each in a new language, Al-Sayyid Bedouin Sign Language. We analyze the structure of narratives of four signers of this language: two older second generation signers, and two about 15 years younger. We find that younger signers produce prosodic cues to dependency between semantically related constituents, e.g., the two clauses of conditionals, revealing a type and degree of complexity in their language that is not frequent in that of the older pair. In these younger signers, several rhythmic and (facial) intonational cues are aligned at constituent boundaries, indicating the emergence of a grammatical system. There are no overt syntactic markers (such as complementizers) to relate clauses; prosody is the only clue. But this prosodic complexity is matched by syntactic complexity inside propositions in the younger signers, who are more likely to use pronouns as abstract grammatical markers of arguments, and to combine predicates with their arguments within in a constituent. As the prosodic means emerge for identifying constituent types and signaling dependency relations between them, the constituents themselves become increasingly complex. Finally, our study shows that the emergence of grammatical complexity is gradual. PMID:23087486
Facial and prosodic emotion recognition in social anxiety disorder.
Tseng, Huai-Hsuan; Huang, Yu-Lien; Chen, Jian-Ting; Liang, Kuei-Yu; Lin, Chao-Cheng; Chen, Sue-Huei
2017-07-01
Patients with social anxiety disorder (SAD) have a cognitive preference to negatively evaluate emotional information. In particular, the preferential biases in prosodic emotion recognition in SAD have been much less explored. The present study aims to investigate whether SAD patients retain negative evaluation biases across visual and auditory modalities when given sufficient response time to recognise emotions. Thirty-one SAD patients and 31 age- and gender-matched healthy participants completed a culturally suitable non-verbal emotion recognition task and received clinical assessments for social anxiety and depressive symptoms. A repeated measures analysis of variance was conducted to examine group differences in emotion recognition. Compared to healthy participants, SAD patients were significantly less accurate at recognising facial and prosodic emotions, and spent more time on emotion recognition. The differences were mainly driven by the lower accuracy and longer reaction times for recognising fearful emotions in SAD patients. Within the SAD patients, lower accuracy of sad face recognition was associated with higher severity of depressive and social anxiety symptoms, particularly with avoidance symptoms. These findings may represent a cross-modality pattern of avoidance in the later stage of identifying negative emotions in SAD. This pattern may be linked to clinical symptom severity.
Signature of prosody in tonal realization: Evidence from Standard Chinese
NASA Astrophysics Data System (ADS)
Chen, Yiya
2004-05-01
It is by now widely accepted that the articulation of speech is influenced by the prosodic structure into which the utterance is organized. Furthermore, the effect of prosody on F0 realization has been shown to be mainly phonological [Beckman and Pierrehumbert (1986); Selkirk and Shen (1990)]. This paper presents data from the F0 realizations of lexical tones in Standard Chinese and shows that prosodic factors may influence the articulation of a lexical tone and induce phonetic variations in its surface F0 contours, similar to the phonetic effect of prosody on segment articulation [de Jong (1995); Keating and Foureron (1997)]. Data were elicited from four native speakers of Standard Chinese producing all four lexical tones in different tonal contexts and under various focus conditions (i.e., under focus, no focus, and post focus), with three renditions for each condition. The observed F0 variations are argued to be best analyzed as resulted from prosodically driven differences in the phonetic implementation of the lexical tonal targets, which in turn is induced by pragmatically driven differences in how distinctive an underlying tonal target should be realized. Implications of this study on the phonetic implementation of phonological tonal targets will also be discussed.
On the role of attention for the processing of emotions in speech: sex differences revisited.
Schirmer, Annett; Kotz, Sonja A; Friederici, Angela D
2005-08-01
In a previous cross-modal priming study [A. Schirmer, A.S. Kotz, A.D. Friederici, Sex differentiates the role of emotional prosody during word processing, Cogn. Brain Res. 14 (2002) 228-233.], we found that women integrated emotional prosody and word valence earlier than men. Both sexes showed a smaller N400 in the event-related potential to emotional words when these words were preceded by a sentence with congruous compared to incongruous emotional prosody. However, women showed this effect with a 200-ms interval between prime sentence and target word whereas men showed the effect with a 750-ms interval. The present study was designed to determine whether these sex differences prevail when attention is directed towards the emotional content of prosody and word meaning. To this end, we presented the same prime sentences and target words as in our previous study. Sentences were spoken with happy or sad prosody and followed by a congruous or incongruous emotional word or pseudoword. The interval between sentence offset and target onset was 200 ms. In addition to performing a lexical decision, participants were asked to decide whether or not a word matched the emotional prosody of the preceding sentence. The combined lexical and congruence judgment failed to reveal differences in emotional-prosodic priming between men and women. Both sexes showed smaller N400 amplitudes to emotionally congruent compared to incongruent words. This suggests that the presence of sex differences in emotional-prosodic priming depends on whether or not participants are instructed to take emotional prosody into account.
Effects of prosodically modulated sub-phonetic variation on lexical competition.
Salverda, Anne Pier; Dahan, Delphine; Tanenhaus, Michael K; Crosswhite, Katherine; Masharov, Mikhail; McDonough, Joyce
2007-11-01
Eye movements were monitored as participants followed spoken instructions to manipulate one of four objects pictured on a computer screen. Target words occurred in utterance-medial (e.g., Put the cap next to the square) or utterance-final position (e.g., Now click on the cap). Displays consisted of the target picture (e.g., a cap), a monosyllabic competitor picture (e.g., a cat), a polysyllabic competitor picture (e.g., a captain) and a distractor (e.g., a beaker). The relative proportion of fixations to the two types of competitor pictures changed as a function of the position of the target word in the utterance, demonstrating that lexical competition is modulated by prosodically conditioned phonetic variation.
Voice Quality Modelling for Expressive Speech Synthesis
Socoró, Joan Claudi
2014-01-01
This paper presents the perceptual experiments that were carried out in order to validate the methodology of transforming expressive speech styles using voice quality (VoQ) parameters modelling, along with the well-known prosody (F 0, duration, and energy), from a neutral style into a number of expressive ones. The main goal was to validate the usefulness of VoQ in the enhancement of expressive synthetic speech in terms of speech quality and style identification. A harmonic plus noise model (HNM) was used to modify VoQ and prosodic parameters that were extracted from an expressive speech corpus. Perception test results indicated the improvement of obtained expressive speech styles using VoQ modelling along with prosodic characteristics. PMID:24587738
Impact of Acute Sleep Deprivation on Sarcasm Detection
Mary, Alison; Slama, Hichem; Cleeremans, Axel; Peigneux, Philippe; Kissine, Mikhail
2015-01-01
There is growing evidence that sleep plays a pivotal role on health, cognition and emotional regulation. However, the interplay between sleep and social cognition remains an uncharted research area. In particular, little is known about the impact of sleep deprivation on sarcasm detection, an ability which, once altered, may hamper everyday social interactions. The aim of this study is to determine whether sleep-deprived participants are as able as sleep-rested participants to adopt another perspective in gauging sarcastic statements. At 9am, after a whole night of sleep (n = 15) or a sleep deprivation night (n = 15), participants had to read the description of an event happening to a group of friends. An ambiguous voicemail message left by one of the friends on another's phone was then presented, and participants had to decide whether the recipient would perceive the message as sincere or as sarcastic. Messages were uttered with a neutral intonation and were either: (1) sarcastic from both the participant’s and the addressee’s perspectives (i.e. both had access to the relevant background knowledge to gauge the message as sarcastic), (2) sarcastic from the participant’s but not from the addressee’s perspective (i.e. the addressee lacked context knowledge to detect sarcasm) or (3) sincere. A fourth category consisted in messages sarcastic from both the participant’s and from the addressee’s perspective, uttered with a sarcastic tone. Although sleep-deprived participants were as accurate as sleep-rested participants in interpreting the voice message, they were also slower. Blunted reaction time was not fully explained by generalized cognitive slowing after sleep deprivation; rather, it could reflect a compensatory mechanism supporting normative accuracy level in sarcasm understanding. Introducing prosodic cues compensated for increased processing difficulties in sarcasm detection after sleep deprivation. Our findings support the hypothesis that sleep deprivation might damage the flow of social interactions by slowing perspective-taking processes. PMID:26535906
Impact of Acute Sleep Deprivation on Sarcasm Detection.
Deliens, Gaétane; Stercq, Fanny; Mary, Alison; Slama, Hichem; Cleeremans, Axel; Peigneux, Philippe; Kissine, Mikhail
2015-01-01
There is growing evidence that sleep plays a pivotal role on health, cognition and emotional regulation. However, the interplay between sleep and social cognition remains an uncharted research area. In particular, little is known about the impact of sleep deprivation on sarcasm detection, an ability which, once altered, may hamper everyday social interactions. The aim of this study is to determine whether sleep-deprived participants are as able as sleep-rested participants to adopt another perspective in gauging sarcastic statements. At 9am, after a whole night of sleep (n = 15) or a sleep deprivation night (n = 15), participants had to read the description of an event happening to a group of friends. An ambiguous voicemail message left by one of the friends on another's phone was then presented, and participants had to decide whether the recipient would perceive the message as sincere or as sarcastic. Messages were uttered with a neutral intonation and were either: (1) sarcastic from both the participant's and the addressee's perspectives (i.e. both had access to the relevant background knowledge to gauge the message as sarcastic), (2) sarcastic from the participant's but not from the addressee's perspective (i.e. the addressee lacked context knowledge to detect sarcasm) or (3) sincere. A fourth category consisted in messages sarcastic from both the participant's and from the addressee's perspective, uttered with a sarcastic tone. Although sleep-deprived participants were as accurate as sleep-rested participants in interpreting the voice message, they were also slower. Blunted reaction time was not fully explained by generalized cognitive slowing after sleep deprivation; rather, it could reflect a compensatory mechanism supporting normative accuracy level in sarcasm understanding. Introducing prosodic cues compensated for increased processing difficulties in sarcasm detection after sleep deprivation. Our findings support the hypothesis that sleep deprivation might damage the flow of social interactions by slowing perspective-taking processes.
van Rijn, Sophie; Aleman, André; van Diessen, Eric; Berckmoes, Celine; Vingerhoets, Guy; Kahn, René S
2005-06-01
Emotional signals in spoken language can be conveyed by semantic as well as prosodic cues. We investigated the role of the fronto-parietal operculum, a somatosensory area where the lips, tongue and jaw are represented, in the right hemisphere to detection of emotion in prosody vs. semantics. A total of 14 healthy volunteers participated in the present experiment, which involved transcranial magnetic stimulation (TMS) in combination with frameless stereotaxy. As predicted, compared with sham stimulation, TMS over the right fronto-parietal operculum differentially affected the reaction times for detection of emotional prosody vs. emotional semantics, showing that there is a dissociation at a neuroanatomical level. Detection of withdrawal emotions (fear and sadness) in prosody was delayed significantly by TMS. No effects of TMS were observed for approach emotions (happiness and anger). We propose that the right fronto-parietal operculum is not globally involved in emotion evaluation, but sensitive to specific forms of emotional discrimination and emotion types.
Torppa, Ritva; Faulkner, Andrew; Huotilainen, Minna; Järvikivi, Juhani; Lipsanen, Jari; Laasonen, Marja; Vainio, Martti
2014-03-01
To study prosodic perception in early-implanted children in relation to auditory discrimination, auditory working memory, and exposure to music. Word and sentence stress perception, discrimination of fundamental frequency (F0), intensity and duration, and forward digit span were measured twice over approximately 16 months. Musical activities were assessed by questionnaire. Twenty-one early-implanted and age-matched normal-hearing (NH) children (4-13 years). Children with cochlear implants (CIs) exposed to music performed better than others in stress perception and F0 discrimination. Only this subgroup of implanted children improved with age in word stress perception, intensity discrimination, and improved over time in digit span. Prosodic perception, F0 discrimination and forward digit span in implanted children exposed to music was equivalent to the NH group, but other implanted children performed more poorly. For children with CIs, word stress perception was linked to digit span and intensity discrimination: sentence stress perception was additionally linked to F0 discrimination. Prosodic perception in children with CIs is linked to auditory working memory and aspects of auditory discrimination. Engagement in music was linked to better performance across a range of measures, suggesting that music is a valuable tool in the rehabilitation of implanted children.
Do Persian Native Speakers Prosodically Mark Wh-in-situ Questions?
Shiamizadeh, Zohreh; Caspers, Johanneke; Schiller, Niels O
2018-02-01
It has been shown that prosody contributes to the contrast between declarativity and interrogativity, notably in interrogative utterances lacking lexico-syntactic features of interrogativity. Accordingly, it may be proposed that prosody plays a role in marking wh-in-situ questions in which the interrogativity feature (the wh-phrase) does not move to sentence-initial position, as, for example, in Persian. This paper examines whether prosody distinguishes Persian wh-in-situ questions from declaratives in the absence of the interrogativity feature in the sentence-initial position. To answer this question, a production experiment was designed in which wh-questions and declaratives were elicited from Persian native speakers. On the basis of the results of previous studies, we hypothesize that prosodic features mark wh-in-situ questions as opposed to declaratives at both the local (pre- and post-wh part) and global level (complete sentence). The results of the current study confirm our hypothesis that prosodic correlates mark the pre-wh part as well as the complete sentence in wh-in-situ questions. The results support theoretical concepts such as the frequency code, the universal dichotomous association between relaxation and declarativity on the one hand and tension and interrogativity on the other, the relation between prosody and pragmatics, and the relation between prosody and encoding and decoding of sentence type.
Li, X; Yang, Y; Ren, G
2009-06-16
Language is often perceived together with visual information. Recent experimental evidences indicated that, during spoken language comprehension, the brain can immediately integrate visual information with semantic or syntactic information from speech. Here we used the mismatch negativity to further investigate whether prosodic information from speech could be immediately integrated into a visual scene context or not, and especially the time course and automaticity of this integration process. Sixteen Chinese native speakers participated in the study. The materials included Chinese spoken sentences and picture pairs. In the audiovisual situation, relative to the concomitant pictures, the spoken sentence was appropriately accented in the standard stimuli, but inappropriately accented in the two kinds of deviant stimuli. In the purely auditory situation, the speech sentences were presented without pictures. It was found that the deviants evoked mismatch responses in both audiovisual and purely auditory situations; the mismatch negativity in the purely auditory situation peaked at the same time as, but was weaker than that evoked by the same deviant speech sounds in the audiovisual situation. This pattern of results suggested immediate integration of prosodic information from speech and visual information from pictures in the absence of focused attention.
Processing prosodic structure by adults with language-based learning disability.
Bahl, Megha; Plante, Elena; Gerken, LouAnn
2009-01-01
Two experiments investigated the ability of adults with a history of language-based learning disability (hLLD) and their normal language (NL) peers to learn prosodic patterns of a novel language. Participants were exposed to stimuli from an artificial language and tested on items that required generalization of the stress patterns and the hierarchical principles of stress assignment that could be inferred from the input. In Study 1, the NL group successfully generalized the patterns of stress heard during familiarization, but failed to show generalization of the hierarchical principles. The hLLD group performed at chance for both types of generalization items. In Study 2, the intensity of stress elements was increased. The performance of the NL group improved whereas the hLLD groups' performance decreased on both types of generalization items. The results indicate that NL adults are able to successfully abstract the complex hierarchical rules of stress if the prosodic cues are made sufficiently salient, but this same task is difficult for adults with hLLD. The reader will be able to understand: (1) the difference in the ability of hLLD and NL adults to process stress assignment in an implicit learning context and (2) that typical adults can abstract complex hierarchical rules of stress assignment when provided with strong cues.
Räsänen, Okko; Kakouros, Sofoklis; Soderstrom, Melanie
2018-06-06
The exaggerated intonation and special rhythmic properties of infant-directed speech (IDS) have been hypothesized to attract infants' attention to the speech stream. However, there has been little work actually connecting the properties of IDS to models of attentional processing or perceptual learning. A number of such attention models suggest that surprising or novel perceptual inputs attract attention, where novelty can be operationalized as the statistical (un)predictability of the stimulus in the given context. Since prosodic patterns such as F0 contours are accessible to young infants who are also known to be adept statistical learners, the present paper investigates a hypothesis that F0 contours in IDS are less predictable than those in adult-directed speech (ADS), given previous exposure to both speaking styles, thereby potentially tapping into basic attentional mechanisms of the listeners in a similar manner that relative probabilities of other linguistic patterns are known to modulate attentional processing in infants and adults. Computational modeling analyses with naturalistic IDS and ADS speech from matched speakers and contexts show that IDS intonation has lower overall temporal predictability even when the F0 contours of both speaking styles are normalized to have equal means and variances. A closer analysis reveals that there is a tendency of IDS intonation to be less predictable at the end of short utterances, whereas ADS exhibits more stable average predictability patterns across the full extent of the utterances. The difference between IDS and ADS persists even when the proportion of IDS and ADS exposure is varied substantially, simulating different relative amounts of IDS heard in different family and cultural environments. Exposure to IDS is also found to be more efficient for predicting ADS intonation contours in new utterances than exposure to the equal amount of ADS speech. This indicates that the more variable prosodic contours of IDS also generalize to ADS, and may therefore enhance prosodic learning in infancy. Overall, the study suggests that one reason behind infant preference for IDS could be its higher information value at the prosodic level, as measured by the amount of surprisal in the F0 contours. This provides the first formal link between the properties of IDS and the models of attentional processing and statistical learning in the brain. However, this finding does not rule out the possibility that other differences between the IDS and ADS also play a role. Copyright © 2018 Elsevier B.V. All rights reserved.
Effects of prosodically-modulated sub-phonetic variation on lexical competition
Salverda, Anne Pier; Dahan, Delphine; Tanenhaus, Michael K.; Crosswhite, Katherine; Masharov, Mikhail; McDonough, Joyce
2007-01-01
Eye movements were monitored as participants followed spoken instructions to manipulate one of four objects pictured on a computer screen. Target words occurred in utterance-medial (e.g., Put the cap next to the square) or utterance-final position (e.g., Now click on the cap). Displays consisted of the target picture (e.g., a cap), a monosyllabic competitor picture (e.g., a cat), a polysyllabic competitor picture (e.g., a captain) and a distractor (e.g., a beaker). The relative proportion of fixations to the two types of competitor pictures changed as a function of the position of the target word in the utterance, demonstrating that lexical competition is modulated by prosodically-conditioned phonetic variation. PMID:17141751
The strategic use of noise in pragmatic reasoning.
Bergen, Leon; Goodman, Noah D
2015-04-01
We combine two recent probabilistic approaches to natural language understanding, exploring the formal pragmatics of communication on a noisy channel. We first extend a model of rational communication between a speaker and listener, to allow for the possibility that messages are corrupted by noise. In this model, common knowledge of a noisy channel leads to the use and correct understanding of sentence fragments. A further extension of the model, which allows the speaker to intentionally reduce the noise rate on a word, is used to model prosodic emphasis. We show that the model derives several well-known changes in meaning associated with prosodic emphasis. Our results show that nominal amounts of actual noise can be leveraged for communicative purposes. Copyright © 2015 Cognitive Science Society, Inc.
Ben-David, Boaz M; Multani, Namita; Shakuf, Vered; Rudzicz, Frank; van Lieshout, Pascal H H M
2016-02-01
Our aim is to explore the complex interplay of prosody (tone of speech) and semantics (verbal content) in the perception of discrete emotions in speech. We implement a novel tool, the Test for Rating of Emotions in Speech. Eighty native English speakers were presented with spoken sentences made of different combinations of 5 discrete emotions (anger, fear, happiness, sadness, and neutral) presented in prosody and semantics. Listeners were asked to rate the sentence as a whole, integrating both speech channels, or to focus on one channel only (prosody or semantics). We observed supremacy of congruency, failure of selective attention, and prosodic dominance. Supremacy of congruency means that a sentence that presents the same emotion in both speech channels was rated highest; failure of selective attention means that listeners were unable to selectively attend to one channel when instructed; and prosodic dominance means that prosodic information plays a larger role than semantics in processing emotional speech. Emotional prosody and semantics are separate but not separable channels, and it is difficult to perceive one without the influence of the other. Our findings indicate that the Test for Rating of Emotions in Speech can reveal specific aspects in the processing of emotional speech and may in the future prove useful for understanding emotion-processing deficits in individuals with pathologies.
Panday, Seema; Kathard, Harsha; Pillay, Mershen; Govender, Cyril
2009-01-01
The aim of this investigation was to determine which of 58 preselected Zulu words developed by Panday et al. (2007) could be used for Speech Reception Threshold (SRT) testing. To realize this aim the homogeneity of audibility of 58 bisyllabic Zulu low tone verbs was measured, followed by an analysis of the prosodic features of the selected words. The words were digitally recorded by a Zulu first language male speaker and presented at 6 intensity levels to 30 Zulu first language speakers (18-25 years, mean age of 21.5 years), whose hearing was normal. Homogeneity of audibility was determined by employing logistic regression analysis. Twenty eight words met the criterion of homogeneity of audibility. This was evidenced by a mean slope of 50% at 5.98%/dB. The prosodic features of the twenty eight words were further analyzed using a computerized speech laboratory system. The findings confirmed that the pitch contours of the words followed the prosodic pattern apparent within Zulu linguistic structure. Eighty nine percent of the Zulu verbs were found to have a difference in the pitch pattern between the two syllables i.e. the first syllable was low in pitch, while the second syllable was high in pitch. It emerged that the twenty eight words could be used for establishing SRT within a normal hearing Zulu speaking population. Further research within clinical populations is recommended.
Chen, Qingrong; Zhang, Jingjing; Xu, Xiaodong; Scheepers, Christoph; Yang, Yiming; Tanenhaus, Michael K
2016-09-01
In an ERP study, classic Chinese poems with a well-known rhyme scheme were used to generate an expectation of a rhyme in the absence of an expectation for a specific character. Critical characters were either consistent or inconsistent with the expected rhyme scheme and semantically congruent or incongruent with the content of the poem. These stimuli allowed us to examine whether a top-down rhyme scheme expectation would affect relatively early components of the ERP associated with character-to-sound mapping (P200) and lexically-mediated semantic processing (N400). The ERP data revealed that rhyme scheme congruence, but not semantic congruence modulated the P200: rhyme-incongruent characters elicited a P200 effect across the head demonstrating that top-down expectations influence early phonological coding of the character before lexical-semantic processing. Rhyme scheme incongruence also produced a right-lateralized N400-like effect. Moreover, compared to semantically congruous poems, semantically incongruous poems produced a larger N400 response only when the character was consistent with the expected rhyme scheme. The results suggest that top-down prosodic expectations can modulate early phonological processing in visual word recognition, indicating that prosodic expectations might play an important role in silent reading. They also suggest that semantic processing is influenced by general knowledge of text genre. Copyright © 2016 Elsevier B.V. All rights reserved.
Kinematic parameters of signed verbs.
Malaia, Evie; Wilbur, Ronnie B; Milkovic, Marina
2013-10-01
Sign language users recruit physical properties of visual motion to convey linguistic information. Research on American Sign Language (ASL) indicates that signers systematically use kinematic features (e.g., velocity, deceleration) of dominant hand motion for distinguishing specific semantic properties of verb classes in production ( Malaia & Wilbur, 2012a) and process these distinctions as part of the phonological structure of these verb classes in comprehension ( Malaia, Ranaweera, Wilbur, & Talavage, 2012). These studies are driven by the event visibility hypothesis by Wilbur (2003), who proposed that such use of kinematic features should be universal to sign language (SL) by the grammaticalization of physics and geometry for linguistic purposes. In a prior motion capture study, Malaia and Wilbur (2012a) lent support for the event visibility hypothesis in ASL, but there has not been quantitative data from other SLs to test the generalization to other languages. The authors investigated the kinematic parameters of predicates in Croatian Sign Language ( Hrvatskom Znakovnom Jeziku [HZJ]). Kinematic features of verb signs were affected both by event structure of the predicate (semantics) and phrase position within the sentence (prosody). The data demonstrate that kinematic features of motion in HZJ verb signs are recruited to convey morphological and prosodic information. This is the first crosslinguistic motion capture confirmation that specific kinematic properties of articulator motion are grammaticalized in other SLs to express linguistic features.
A Joint Prosodic Origin of Language and Music
Brown, Steven
2017-01-01
Vocal theories of the origin of language rarely make a case for the precursor functions that underlay the evolution of speech. The vocal expression of emotion is unquestionably the best candidate for such a precursor, although most evolutionary models of both language and speech ignore emotion and prosody altogether. I present here a model for a joint prosodic precursor of language and music in which ritualized group-level vocalizations served as the ancestral state. This precursor combined not only affective and intonational aspects of prosody, but also holistic and combinatorial mechanisms of phrase generation. From this common stage, there was a bifurcation to form language and music as separate, though homologous, specializations. This separation of language and music was accompanied by their (re)unification in songs with words. PMID:29163276
Ixpantepec Nieves Mixtec Word Prosody
NASA Astrophysics Data System (ADS)
Carroll, Lucien Serapio
This dissertation presents a phonological description and acoustic analysis of the word prosody of Ixpantepec Nieves Mixtec, which involves both a complex tone system and a default stress system. The analysis of Nieves Mixtec word prosody is complicated by a close association between morphological structure and prosodic structure, and by the interactions between word prosody and phonation type, which has both contrastive and non-contrastive roles in the phonology. I contextualize these systems within the phonology of Nieves Mixtec as a whole, within the literature on other Mixtec varieties, and within the literature on cross-linguistic prosodic typology. The literature on prosodic typology indicates that stress is necessarily defined abstractly, as structured prominence realized differently in each language. Descriptions of stress in other Mixtec varieties widely report default stress on the initial syllable of the canonical bimoraic root, though some descriptions suggest final stress or mobile stress. I first present phonological evidence---from distributional restrictions, phonological processes, and loanword adaptation---that Nieves Mixtec word prosody does involve a stress system, based on trochaic feet aligned to the root. I then present an acoustic study comparing stressed syllables to unstressed syllables, for ten potential acoustic correlates of stress. The results indicate that the acoustic correlates of stress in Nieves Mixtec include segmental duration, intensity and periodicity. Building on analyses of other Mixtec tone systems, I show that the distribution of tone and the tone processes in Nieves Mixtec support an analysis in which morae may bear H, M or L tone, where M tone is underlyingly unspecified, and each morpheme may sponsor a final +H or +L floating tone. Bimoraic roots thus host up to two linked tones and one floating tone, while monomoraic clitics host just one linked tone and one floating tone, and tonal morphemes are limited to a single floating tone. I then present three studies describing the acoustic realization of tone and comparing the realization of tone in different prosodic types. The findings of these studies include a strong directional asymmetry in tonal coarticulation, increased duration at the word or phrase boundary, phonation differences among the tone categories, and F0 differences between the glottalization categories.
"The caterpillar": a novel reading passage for assessment of motor speech disorders.
Patel, Rupal; Connaghan, Kathryn; Franco, Diana; Edsall, Erika; Forgit, Dory; Olsen, Laura; Ramage, Lianna; Tyler, Emily; Russell, Scott
2013-02-01
A review of the salient characteristics of motor speech disorders and common assessment protocols revealed the need for a novel reading passage tailored specifically to differentiate between and among the dysarthrias (DYSs) and apraxia of speech (AOS). "The Caterpillar" passage was designed to provide a contemporary, easily read, contextual speech sample with specific tasks (e.g., prosodic contrasts, words of increasing length and complexity) targeted to inform the assessment of motor speech disorders. Twenty-two adults, 15 with DYS or AOS and 7 healthy controls (HC), were recorded reading "The Caterpillar" passage to demonstrate its utility in examining motor speech performance. Analysis of performance across a subset of segmental and prosodic variables illustrated that "The Caterpillar" passage showed promise for extracting individual profiles of impairment that could augment current assessment protocols and inform treatment planning in motor speech disorders.
Beat gestures help preschoolers recall and comprehend discourse information.
Llanes-Coromina, Judith; Vilà-Giménez, Ingrid; Kushch, Olga; Borràs-Comes, Joan; Prieto, Pilar
2018-08-01
Although the positive effects of iconic gestures on word recall and comprehension by children have been clearly established, less is known about the benefits of beat gestures (rhythmic hand/arm movements produced together with prominent prosody). This study investigated (a) whether beat gestures combined with prosodic information help children recall contrastively focused words as well as information related to those words in a child-directed discourse (Experiment 1) and (b) whether the presence of beat gestures helps children comprehend a narrative discourse (Experiment 2). In Experiment 1, 51 4-year-olds were exposed to a total of three short stories with contrastive words presented in three conditions, namely with prominence in both speech and gesture, prominence in speech only, and nonprominent speech. Results of a recall task showed that (a) children remembered more words when exposed to prominence in both speech and gesture than in either of the other two conditions and that (b) children were more likely to remember information related to those words when the words were associated with beat gestures. In Experiment 2, 55 5- and 6-year-olds were presented with six narratives with target items either produced with prosodic prominence but no beat gestures or produced with both prosodic prominence and beat gestures. Results of a comprehension task demonstrated that stories told with beat gestures were comprehended better by children. Together, these results constitute evidence that beat gestures help preschoolers not only to recall discourse information but also to comprehend it. Copyright © 2018 Elsevier Inc. All rights reserved.
The Therapeutic Effect of Speechvive on Prosody in Parkinson's Disease
NASA Astrophysics Data System (ADS)
Kiefer, Brianna Rose
It is well known that physiological impairments secondary to Parkinson's Disease (PD) negatively impact speech production. Individuals with PD display vocal, prosodic, resonant, and articulatory abnormalities which reduce communicative effectiveness. Prosody is a broad term which refers to the alterations in pitch, duration, and loudness used by speakers to convey important linguistic and paralinguistic information during speech. Little is known about the prosodic abnormalities associated with PD relative to healthy older adults; however, it is well known that individuals with PD display impairments in their ability to modulate the acoustic cues (pitch, duration, intensity) associated with prosodic inflection in speech. Literature presently lacks sufficient evidence to support treatment paradigms commonly used to address dysprosody in PD. Thus, there is a significant need to develop and investigate potential evidence-based treatment paradigms for dysprosody associated with PD. The present study aimed to examine the potential treatment effects the SpeechVive device has on treating dysprosody in PD. Acoustic recordings were obtained from 15 individuals with PD during a reading task. Participants read the passage at the start of the study and 12 weeks later, after wearing the SpeechVive device for the intervening weeks. Main outcome measures examined productions of contrastive stress, intonation contours, rate, and patterns of pausing. The results revealed that participants increased vocal intensity levels during the production of stressed words and improved standard deviation of pitch during the productions of intonation contours. Lastly, the device was found to improve participants' abilities to pause relative to syntactic boundaries.
Speech Prosody Across Stimulus Types for Individuals with Parkinson's Disease.
K-Y Ma, Joan; Schneider, Christine B; Hoffmann, Rüdiger; Storch, Alexander
2015-01-01
Up to 89% of the individuals with Parkinson's disease (PD) experience speech problem over the course of the disease. Speech prosody and intelligibility are two of the most affected areas in hypokinetic dysarthria. However, assessment of these areas could potentially be problematic as speech prosody and intelligibility could be affected by the type of speech materials employed. To comparatively explore the effects of different types of speech stimulus on speech prosody and intelligibility in PD speakers. Speech prosody and intelligibility of two groups of individuals with varying degree of dysarthria resulting from PD was compared to that of a group of control speakers using sentence reading, passage reading and monologue. Acoustic analysis including measures on fundamental frequency (F0), intensity and speech rate was used to form a prosodic profile for each individual. Speech intelligibility was measured for the speakers with dysarthria using direct magnitude estimation. Difference in F0 variability between the speakers with dysarthria and control speakers was only observed in sentence reading task. Difference in the average intensity level was observed for speakers with mild dysarthria to that of the control speakers. Additionally, there were stimulus effect on both intelligibility and prosodic profile. The prosodic profile of PD speakers was different from that of the control speakers in the more structured task, and lower intelligibility was found in less structured task. This highlighted the value of both structured and natural stimulus to evaluate speech production in PD speakers.
Haderlein, Tino; Döllinger, Michael; Matoušek, Václav; Nöth, Elmar
2016-10-01
Automatic voice assessment is often performed using sustained vowels. In contrast, speech analysis of read-out texts can be applied to voice and speech assessment. Automatic speech recognition and prosodic analysis were used to find regression formulae between automatic and perceptual assessment of four voice and four speech criteria. The regression was trained with 21 men and 62 women (average age 49.2 years) and tested with another set of 24 men and 49 women (48.3 years), all suffering from chronic hoarseness. They read the text 'Der Nordwind und die Sonne' ('The North Wind and the Sun'). Five voice and speech therapists evaluated the data on 5-point Likert scales. Ten prosodic and recognition accuracy measures (features) were identified which describe all the examined criteria. Inter-rater correlation within the expert group was between r = 0.63 for the criterion 'match of breath and sense units' and r = 0.87 for the overall voice quality. Human-machine correlation was between r = 0.40 for the match of breath and sense units and r = 0.82 for intelligibility. The perceptual ratings of different criteria were highly correlated with each other. Likewise, the feature sets modeling the criteria were very similar. The automatic method is suitable for assessing chronic hoarseness in general and for subgroups of functional and organic dysphonia. In its current version, it is almost as reliable as a randomly picked rater from a group of voice and speech therapists.
The role of prominence in determining the scope of boundary-related lengthening in Greek.
Katsika, Argyro
2016-03-01
This study aims at examining and accounting for the scope of the temporal effect of phrase boundaries. Previous research has indicated that there is an interaction between boundary-related lengthening and prominence such that the former extends towards the nearby prominent syllable. However, it is unclear whether this interaction is due to lexical stress and/or phrasal prominence (marked by pitch accent) and how far towards the prominent syllable the effect extends. Here, we use an electromagnetic articulography (EMA) study of Greek to examine the scope of boundary-related lengthening as a function of lexical stress and pitch accent separately. Boundaries are elicited by the means of a variety of syntactic constructions.. The results show an effect of lexical stress. Phrase-final lengthening affects the articulatory gestures of the phrase-final syllable that are immediately adjacent to the boundary in words with final stress, but is initiated earlier within phrase-final words with non-final stress. Similarly, the articulatory configurations during inter-phrasal pauses reach their point of achievement later in words with final stress than in words with non-final stress. These effects of stress hold regardless of whether the phrase-final word is accented or de-accented. Phrase-initial lengthening, on the other hand, is consistently detected on the phrase-initial constriction, independently of where the stress is within the preceding, phrase-final, word. These results indicate that the lexical aspect of prominence plays a role in determining the scope of boundary-related lengthening in Greek. Based on these results, a gestural account of prosodic boundaries in Greek is proposed in which lexical and phrasal prosody interact in a systematic and coordinated fashion. The cross-linguistic dimensions of this account and its implications for prosodic structure are discussed.
Perception of Intonation in Native and Non-Native Speakers of English.
ERIC Educational Resources Information Center
Berkovits, Rochele
1980-01-01
Indicates that native and nonnative speakers alike can make use of intonation if they explicitly listen for it, although prosodic features are generally ignored when other cues (semantic and pragmatic) are available. (Author/RL)
A matter of emphasis: Linguistic stress habits modulate serial recall.
Taylor, John C; Macken, Bill; Jones, Dylan M
2015-04-01
Models of short-term memory for sequential information rely on item-level, feature-based descriptions to account for errors in serial recall. Transposition errors within alternating similar/dissimilar letter sequences derive from interactions between overlapping features. However, in two experiments, we demonstrated that the characteristics of the sequence are what determine the fates of items, rather than the properties ascribed to the items themselves. Performance in alternating sequences is determined by the way that the sequences themselves induce particular prosodic rehearsal patterns, and not by the nature of the items per se. In a serial recall task, the shapes of the canonical "saw-tooth" serial position curves and transposition error probabilities at successive input-output distances were modulated by subvocal rehearsal strategies, despite all item-based parameters being held constant. We replicated this finding using nonalternating lists, thus demonstrating that transpositions are substantially influenced by prosodic features-such as stress-that emerge during subvocal rehearsal.
Kawahara, Shigeto; Shinya, Takahito
2008-01-01
In previous studies of Japanese intonational phonology, levels of prosodic constituents above the Major Phrase have not received much attention. This paper argues that at least two prosodic levels exist above the Major Phrase in Japanese. Through a detailed investigation of the intonation of gapping and coordination in Japanese, we argue that each syntactic clause projects its own Intonational Phrase, while an entire sentence constitutes one Utterance. We show that the Intonational Phrase is characterized by tonal lowering, creakiness and a pause in final position, as well as a distinctive large initial rise and pitch reset at its beginning. The Utterance defines a domain of declination, and it is signaled by an even larger initial rise, as well as a phrasal H tone at its right edge. Building on our empirical findings, we discuss several implications for the theory of intonational phonology. (c) 2007 S. Karger AG, Basel.
Intonation development from five to thirteen.
Wells, Bill; Peppé, Sue; Goulandris, Nata
2004-11-01
Research undertaken to date suggests that important developments in the understanding and use of intonation may take place after the age of 5;0. The present study aims to provide a more comprehensive account of these developments. A specially designed battery of prosodic tasks was administered to four groups of thirty children, from London (U.K.), with mean ages of 5;6, 8;7, 10;10 and 13;9. The tasks tap comprehension and production of functional aspects of intonation, in four communicative areas: CHUNKING (i.e. prosodic phrasing), AFFECT, INTERACTION and FOCUS. Results indicate that there is considerable variability among children within each age band on most tasks. The ability to produce intonation functionally is largely established in five-year-olds, though some specific functional contrasts are not mastered until C.A. 8;7. Aspects of intonation comprehension continue to develop up to C.A. 10;10, correlating with measures of expressive and receptive language development.
Stress Domain Effects in French Phonology and Phonological Development.
Rose, Yvan; Dos Santos, Christophe
In this paper, we discuss two distinct data sets. The first relates to the so-called allophonic process of closed-syllable laxing in Québec French, which targets final (stressed) vowels even though these vowels are arguably syllabified in open syllables in lexical representations. The second is found in the forms produced by a first language learner of European French, who displays an asymmetry in her production of CVC versus CVCV target (adult) forms. The former display full preservation (with concomitant manner harmony) of both consonants. The latter undergoes deletion of the initial syllable if the consonants are not manner-harmonic in the input. We argue that both patterns can be explained through a phonological process of prosodic strengthening targeting the head of the prosodic domain which, in the contexts described above, yields the incorporation of final consonants into the coda of the stressed syllable.
Zhang, Xujin; Samuel, Arthur G.; Liu, Siyun
2011-01-01
Previous research has found that a speaker’s native phonological system has a great influence on perception of another language. In three experiments, we tested the perception and representation of Mandarin phonological contrasts by Guangzhou Cantonese speakers, and compared their performance to that of native Mandarin speakers. Despite their rich experience using Mandarin Chinese, the Cantonese speakers had problems distinguishing specific Mandarin segmental and tonal contrasts that do not exist in Guangzhou Cantonese. However, we found evidence that the subtle differences between two members of a contrast were nonetheless represented in the lexicon. We also found different processing patterns for non-native segmental versus non-native tonal contrasts. The results provide substantial new information about the representation and processing of segmental and prosodic information by individuals listening to a closely-related, very well-learned, but still non-native language. PMID:22707849
Neural Substrates of Processing Anger in Language: Contributions of Prosody and Semantics.
Castelluccio, Brian C; Myers, Emily B; Schuh, Jillian M; Eigsti, Inge-Marie
2016-12-01
Emotions are conveyed primarily through two channels in language: semantics and prosody. While many studies confirm the role of a left hemisphere network in processing semantic emotion, there has been debate over the role of the right hemisphere in processing prosodic emotion. Some evidence suggests a preferential role for the right hemisphere, and other evidence supports a bilateral model. The relative contributions of semantics and prosody to the overall processing of affect in language are largely unexplored. The present work used functional magnetic resonance imaging to elucidate the neural bases of processing anger conveyed by prosody or semantic content. Results showed a robust, distributed, bilateral network for processing angry prosody and a more modest left hemisphere network for processing angry semantics when compared to emotionally neutral stimuli. Findings suggest the nervous system may be more responsive to prosodic cues in speech than to the semantic content of speech.
Cross-linguistic differences in prosodic cues to syntactic disambiguation in German and English
O’Brien, Mary Grantham; Jackson, Carrie N.; Gardner, Christine E.
2012-01-01
This study examined whether late-learning English-German L2 learners and late-learning German-English L2 learners use prosodic cues to disambiguate temporarily ambiguous L1 and L2 sentences during speech production. Experiments 1a and 1b showed that English-German L2 learners and German-English L2 learners used a pitch rise and pitch accent to disambiguate prepositional phrase-attachment sentences in German. However, the same participants, as well as monolingual English speakers, only used pitch accent to disambiguate similar English sentences. Taken together, these results indicate the L2 learners used prosody to disambiguate sentences in both of their languages and did not fully transfer cues to disambiguation from their L1 to their L2. The results have implications for the acquisition of L2 prosody and the interaction between prosody and meaning in L2 production. PMID:24453383
Prosody in the hands of the speaker
Guellaï, Bahia; Langus, Alan; Nespor, Marina
2014-01-01
In everyday life, speech is accompanied by gestures. In the present study, two experiments tested the possibility that spontaneous gestures accompanying speech carry prosodic information. Experiment 1 showed that gestures provide prosodic information, as adults are able to perceive the congruency between low-pass filtered—thus unintelligible—speech and the gestures of the speaker. Experiment 2 shows that in the case of ambiguous sentences (i.e., sentences with two alternative meanings depending on their prosody) mismatched prosody and gestures lead participants to choose more often the meaning signaled by gestures. Our results demonstrate that the prosody that characterizes speech is not a modality specific phenomenon: it is also perceived in the spontaneous gestures that accompany speech. We draw the conclusion that spontaneous gestures and speech form a single communication system where the suprasegmental aspects of spoken language are mapped to the motor-programs responsible for the production of both speech sounds and hand gestures. PMID:25071666
De Looze, Céline; Moreau, Noémie; Renié, Laurent; Kelly, Finnian; Ghio, Alain; Rico, Audrey; Audoin, Bertrand; Viallet, François; Pelletier, Jean; Petrone, Caterina
2017-05-24
Cognitive impairment (CI) affects 40-65% of patients with multiple sclerosis (MS). CI can have a negative impact on a patient's everyday activities, such as engaging in conversations. Speech production planning ability is crucial for successful verbal interactions and thus for preserving social and occupational skills. This study investigates the effect of cognitive-linguistic demand and CI on speech production planning in MS, as reflected in speech prosody. A secondary aim is to explore the clinical potential of prosodic features for the prediction of an individual's cognitive status in MS. A total of 45 subjects, that is 22 healthy controls (HC) and 23 patients in early stages of relapsing-remitting MS, underwent neuropsychological tests probing specific cognitive processes involved in speech production planning. All subjects also performed a read speech task, in which they had to read isolated sentences manipulated as for phonological length. Results show that the speech of MS patients with CI is mainly affected at the temporal level (articulation and speech rate, pause duration). Regression analyses further indicate that rate measures are correlated with working memory scores. In addition, linear discriminant analysis shows the ROC AUC of identifying MS patients with CI is 0.70 (95% confidence interval: 0.68-0.73). Our findings indicate that prosodic planning is deficient in patients with MS-CI and that the scope of planning depends on patients' cognitive abilities. We discuss how speech-based approaches could be used as an ecological method for the assessment and monitoring of CI in MS. © 2017 The British Psychological Society.
Wolff, Susann; Schlesewsky, Matthias; Hirotani, Masako; Bornkessel-Schlesewsky, Ina
2008-11-01
We present two ERP studies on the processing of word order variations in Japanese, a language that is suited to shedding further light on the implications of word order freedom for neurocognitive approaches to sentence comprehension. Experiment 1 used auditory presentation and revealed that initial accusative objects elicit increased processing costs in comparison to initial subjects (in the form of a transient negativity) only when followed by a prosodic boundary. A similar effect was observed using visual presentation in Experiment 2, however only for accusative but not for dative objects. These results support a relational account of word order processing, in which the costs of comprehending an object-initial word order are determined by the linearization properties of the initial object in relation to the linearization properties of possible upcoming arguments. In the absence of a prosodic boundary, the possibility for subject omission in Japanese renders it likely that the initial accusative is the only argument in the clause. Hence, no upcoming arguments are expected and no linearization problem can arise. A prosodic boundary or visual segmentation, by contrast, indicate an object-before-subject word order, thereby leading to a mismatch between argument "prominence" (e.g. in terms of thematic roles) and linear order. This mismatch is alleviated when the initial object is highly prominent itself (e.g. in the case of a dative, which can bear the higher-ranking thematic role in a two argument relation). We argue that the processing mechanism at work here can be distinguished from more general aspects of "dependency processing" in object-initial sentences.
Contributions of speech science to the technology of man-machine voice interactions
NASA Technical Reports Server (NTRS)
Lea, Wayne A.
1977-01-01
Research in speech understanding was reviewed. Plans which include prosodics research, phonological rules for speech understanding systems, and continued interdisciplinary phonetics research are discussed. Improved acoustic phonetic analysis capabilities in speech recognizers are suggested.
Japanese/Korean Linguistics, Volume 8.
ERIC Educational Resources Information Center
Silva, David J., Ed.
A collection of research in Japanese and Korean linguistics includes: "Repetition, Reformulation, and Definitions: Prosodic Indexes of Elaboration in Japanese" (Mieko Banno); "Projection of Talk Using Language, Intonation, Deictic and Iconic Gestures and Other Body Movements" (Keiko Emmett); "Turn-taking in Japanese…
Auditory-Motor Rhythms and Speech Processing in French and German Listeners
Falk, Simone; Volpi-Moncorger, Chloé; Dalla Bella, Simone
2017-01-01
Moving to a speech rhythm can enhance verbal processing in the listener by increasing temporal expectancies (Falk and Dalla Bella, 2016). Here we tested whether this hypothesis holds for prosodically diverse languages such as German (a lexical stress-language) and French (a non-stress language). Moreover, we examined the relation between motor performance and the benefits for verbal processing as a function of language. Sixty-four participants, 32 German and 32 French native speakers detected subtle word changes in accented positions in metrically structured sentences to which they previously tapped with their index finger. Before each sentence, they were cued by a metronome to tap either congruently (i.e., to accented syllables) or incongruently (i.e., to non-accented parts) to the following speech stimulus. Both French and German speakers detected words better when cued to tap congruently compared to incongruent tapping. Detection performance was predicted by participants' motor performance in the non-verbal cueing phase. Moreover, tapping rate while participants tapped to speech predicted detection differently for the two language groups, in particular in the incongruent tapping condition. We discuss our findings in light of the rhythmic differences of both languages and with respect to recent theories of expectancy-driven and multisensory speech processing. PMID:28443036
Auditory-prosodic processing in bipolar disorder; from sensory perception to emotion.
Van Rheenen, Tamsyn E; Rossell, Susan L
2013-12-01
Accurate emotion processing is critical to understanding the social world. Despite growing evidence of facial emotion processing impairments in patients with bipolar disorder (BD), comprehensive investigations of emotional prosodic processing is limited. The existing (albeit sparse) literature is inconsistent at best, and confounded by failures to control for the effects of gender or low level sensory-perceptual impairments. The present study sought to address this paucity of research by utilizing a novel behavioural battery to comprehensively investigate the auditory-prosodic profile of BD. Fifty BD patients and 52 healthy controls completed tasks assessing emotional and linguistic prosody, and sensitivity for discriminating tones that deviate in amplitude, duration and pitch. BD patients were less sensitive than their control counterparts in discriminating amplitude and durational cues but not pitch cues or linguistic prosody. They also demonstrated impaired ability to recognize happy intonations; although this was specific to male's with the disorder. The recognition of happy in the patient group was correlated with pitch and amplitude sensitivity in female patients only. The small sample size of patients after stratification by current mood state prevented us from conducting subgroup comparisons between symptomatic, euthymic and control participants to explicitly examine the effects of mood. Our findings indicate the existence of a female advantage for the processing of emotional prosody in BD, specifically for the processing of happy. Although male BD patients were impaired in their ability to recognize happy prosody, this was unrelated to reduced tone discrimination sensitivity. This study indicates the importance of examining both gender and low order sensory perceptual capacity when examining emotional prosody. © 2013 Elsevier B.V. All rights reserved.
Musical training shapes neural responses to melodic and prosodic expectation.
Zioga, Ioanna; Di Bernardi Luft, Caroline; Bhattacharya, Joydeep
2016-11-01
Current research on music processing and syntax or semantics in language suggests that music and language share partially overlapping neural resources. Pitch also constitutes a common denominator, forming melody in music and prosody in language. Further, pitch perception is modulated by musical training. The present study investigated how music and language interact on pitch dimension and whether musical training plays a role in this interaction. For this purpose, we used melodies ending on an expected or unexpected note (melodic expectancy being estimated by a computational model) paired with prosodic utterances which were either expected (statements with falling pitch) or relatively unexpected (questions with rising pitch). Participants' (22 musicians, 20 nonmusicians) ERPs and behavioural responses in a statement/question discrimination task were recorded. Participants were faster for simultaneous expectancy violations in the melodic and linguistic stimuli. Further, musicians performed better than nonmusicians, which may be related to their increased pitch tracking ability. At the neural level, prosodic violations elicited a front-central positive ERP around 150ms after the onset of the last word/note, while musicians presented reduced P600 in response to strong incongruities (questions on low-probability notes). Critically, musicians' P800 amplitudes were proportional to their level of musical training, suggesting that expertise might shape the pitch processing of language. The beneficial aspect of expertise could be attributed to its strengthening effect of general executive functions. These findings offer novel contributions to our understanding of shared higher-order mechanisms between music and language processing on pitch dimension, and further demonstrate a potential modulation by musical expertise. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.
Dysarthria associated with traumatic brain injury: speaking rate and emphatic stress.
Wang, Yu-Tsai; Kent, Ray D; Duffy, Joseph R; Thomas, Jack E
2005-01-01
Prosodic abnormality is common in the dysarthria associated with traumatic brain injury (TBI), and adjustments of speaking rate and emphatic stress are often used as steps in treating the speech disorder in patients with TBI-induced dysarthria. However, studies to date do not present a clear and detailed picture of how speaking rate and emphatic stress are affected in this speech disorder. This study, based on the acoustic analyses of syllable repetitions and sentence speech samples, reports on speaking rate and emphatic stress for 12 subjects with TBI and 8 healthy controls. For speaking rate, the subjects with TBI had (1) both slow speaking and articulation rates, (2) smaller phonation proportion and larger pause proportion, and (3) larger percentage change in speaking rate and smaller percentage change in articulation rate. For emphatic stress, the subjects with TBI had (1) significant increases in the difference and percentage change between pre-stressed and pre-unstressed pause durations, (2) significantly smaller difference between stressed and unstressed word durations, but not the percentage change between stressed and unstressed word durations, and (3) significantly reduced differences in f(0) movement and f(0) slope between stressed and unstressed words, but not in RMS range. This study demonstrates the multidimensional nature of prosodic deficits in the dysarthria related to TBI and illustrates the ability of acoustic measures to give a picture of the dysprosody related to TBI-induced dysarthria. As a result of this activity, the participant will be able to (1) describe the prosodic disturbances that have been reported in studies of dysarthria associated with TBI; (2) define acoustic measures appropriate to the analysis of changes in speaking rate and emphatic stress; and (3) discuss the importance of prosody to spoken communication.
Cason, Nia; Astésano, Corine; Schön, Daniele
2015-02-01
Following findings that musical rhythmic priming enhances subsequent speech perception, we investigated whether rhythmic priming for spoken sentences can enhance phonological processing - the building blocks of speech - and whether audio-motor training enhances this effect. Participants heard a metrical prime followed by a sentence (with a matching/mismatching prosodic structure), for which they performed a phoneme detection task. Behavioural (RT) data was collected from two groups: one who received audio-motor training, and one who did not. We hypothesised that 1) phonological processing would be enhanced in matching conditions, and 2) audio-motor training with the musical rhythms would enhance this effect. Indeed, providing a matching rhythmic prime context resulted in faster phoneme detection, thus revealing a cross-domain effect of musical rhythm on phonological processing. In addition, our results indicate that rhythmic audio-motor training enhances this priming effect. These results have important implications for rhythm-based speech therapies, and suggest that metrical rhythm in music and speech may rely on shared temporal processing brain resources. Copyright © 2015 Elsevier B.V. All rights reserved.
Atypical neural responses to vocal anger in attention-deficit/hyperactivity disorder.
Chronaki, Georgia; Benikos, Nicholas; Fairchild, Graeme; Sonuga-Barke, Edmund J S
2015-04-01
Deficits in facial emotion processing, reported in attention-deficit/hyperactivity disorder (ADHD), have been linked to both early perceptual and later attentional components of event-related potentials (ERPs). However, the neural underpinnings of vocal emotion processing deficits in ADHD have yet to be characterised. Here, we report the first ERP study of vocal affective prosody processing in ADHD. Event-related potentials of 6-11-year-old children with ADHD (n = 25) and typically developing controls (n = 25) were recorded as they completed a task measuring recognition of vocal prosodic stimuli (angry, happy and neutral). Audiometric assessments were conducted to screen for hearing impairments. Children with ADHD were less accurate than controls at recognising vocal anger. Relative to controls, they displayed enhanced N100 and attenuated P300 components to vocal anger. The P300 effect was reduced, but remained significant, after controlling for N100 effects by rebaselining. Only the N100 effect was significant when children with ADHD and comorbid conduct disorder (n = 10) were excluded. This study provides the first evidence linking ADHD to atypical neural activity during the early perceptual stages of vocal anger processing. These effects may reflect preattentive hyper-vigilance to vocal anger in ADHD. © 2014 Association for Child and Adolescent Mental Health.
ERIC Educational Resources Information Center
Karins, A. Krisjanis
1995-01-01
Investigates variable deletion of short vowels in word-final unstressed syllables in Latvian spoken in Riga. Affected vowels were almost always inflectional endings and results indicated that internal phonological and prosodic factors (especially distance from main word stress) were the strongest constraints on vowel deletion, along with the…
Past Participle Formation in Specific Language Impairment
ERIC Educational Resources Information Center
Kauschke, Christina; Renner, Lena F.; Domahs, Ulrike
2017-01-01
Background: German participles are formed by a co-occurrence of prefixation and suffixation. While the acquisition of regular and irregular suffixation has been investigated exhaustively, it is still unclear how German children master the prosodically determined prefixation rule (prefix "ge-"). Findings reported in the literature are…
The Sound Pattern of Japanese Surnames
ERIC Educational Resources Information Center
Tanaka, Yu
2017-01-01
Compound surnames in Japanese show complex phonological patterns, which pose challenges to current theories of phonology. This dissertation proposes an account of the segmental and prosodic issues in Japanese surnames and discusses their theoretical implications. Like regular compound words, compound surnames may undergo a sound alternation known…
CUHK Papers in Linguistics, Number 1.
ERIC Educational Resources Information Center
Ching, Teresa, Ed.
1989-01-01
Papers included in this volume include the following: "Prosodic Aspects of Hearing-Impaired Children: A Qualitative and Quantitative Assessment" (Teresa Y. C. Ching); "The Role of Linear Order in the Acquisition of Quantifier Scope in Chinese" (Thomas H. T. Lee); "Some Neglected Syntactic Phenomena in Near-standard…
Prosody and Intonation of Western Cham
ERIC Educational Resources Information Center
Ueki, Kaori
2011-01-01
This dissertation investigates the prosodic and intonational characteristics of Western Cham (three letter code for International Organization for Standardization's ISO 639-3 code: [iso=cja]), an Austronesian language in the Chamic sub-group. I examine acoustic variables of prominence at word and postlexical levels: syllable duration, pitch…
Methodological Variables in Choral Reading
ERIC Educational Resources Information Center
Poore, Meredith A.; Ferguson, Sarah Hargus
2008-01-01
This preliminary study explored changes in prosodic variability during choral reading and investigated whether these changes are affected by the method of eliciting choral reading. Ten typical adult talkers recorded three reading materials (poetry, fiction and textbook) in three reading conditions: solo (reading aloud alone), track (reading aloud…
Sperry Univac speech communications technology
NASA Technical Reports Server (NTRS)
Medress, Mark F.
1977-01-01
Technology and systems for effective verbal communication with computers were developed. A continuous speech recognition system for verbal input, a word spotting system to locate key words in conversational speech, prosodic tools to aid speech analysis, and a prerecorded voice response system for speech output are described.
Word Prosody and Intonation of Sgaw Karen
NASA Astrophysics Data System (ADS)
West, Luke Alexander
The prosodic, and specifically intonation, systems of Tibeto-Burman languages have received less attention in research than those of other families. This study investigates the word prosody and intonation of Sgaw Karen, a tonal Tibeto-Burman language of eastern Burma, and finds similarities to both closely related Tibeto-Burman languages and the more distant Sinitic languages like Mandarin. Sentences of varying lengths with controlled tonal environments were elicited from a total of 12 participants (5 male). In terms of word prosody, Sgaw Karen does not exhibit word stress cues, but does maintain a prosodic distinction between the more prominent major syllable and the phonologically reduced minor syllable. In terms of intonation, Sgaw Karen patterns like related Pwo Karen in its limited use of post-lexical tone, which is only present at Intonation Phrase (IP) boundaries. Unlike the intonation systems of Pwo Karen and Mandarin, however, Sgaw Karen exhibits downstep across its Accentual Phrases (AP), similarly to phenomena identified in Tibetan and Burmese.
Fonseca, Rochele Paz; Fachel, Jandyra Maria Guimarães; Chaves, Márcia Lorena Fagundes; Liedtke, Francéia Veiga; Parente, Maria Alice de Mattos Pimenta
2007-01-01
Right-brain-damaged individuals may present discursive, pragmatic, lexical-semantic and/or prosodic disorders. To verify the effect of right hemisphere damage on communication processing evaluated by the Brazilian version of the Protocole Montréal d'Évaluation de la Communication (Montreal Communication Evaluation Battery) - Bateria Montreal de Avaliação da Comunicação, Bateria MAC, in Portuguese. A clinical group of 29 right-brain-damaged participants and a control group of 58 non-brain-damaged adults formed the sample. A questionnaire on sociocultural and health aspects, together with the Brazilian MAC Battery was administered. Significant differences between the clinical and control groups were observed in the following MAC Battery tasks: conversational discourse, unconstrained, semantic and orthographic verbal fluency, linguistic prosody repetition, emotional prosody comprehension, repetition and production. Moreover, the clinical group was less homogeneous than the control group. A right-brain-damage effect was identified directly, on three communication processes: discursive, lexical-semantic and prosodic processes, and indirectly, on pragmatic process.
Bagley, Amy D.; Abramowitz, Carolyn S.; Kosson, David S.
2010-01-01
Deficits in emotion processing have been widely reported to be central to psychopathy. However, few prior studies have examined vocal affect recognition in psychopaths, and these studies suffer from significant methodological limitations. Moreover, prior studies have yielded conflicting findings regarding the specificity of psychopaths’ affect recognition deficits. This study examined vocal affect recognition in 107 male inmates under conditions requiring isolated prosodic vs. semantic analysis of affective cues and compared subgroups of offenders identified via cluster analysis on vocal affect recognition. Psychopaths demonstrated deficits in vocal affect recognition under conditions requiring use of semantic cues and conditions requiring use of prosodic cues. Moreover, both primary and secondary psychopaths exhibited relatively similar emotional deficits in the semantic analysis condition compared to nonpsychopathic control participants. This study demonstrates that psychopaths’ vocal affect recognition deficits are not due to methodological limitations of previous studies and provides preliminary evidence that primary and secondary psychopaths exhibit generally similar deficits in vocal affect recognition. PMID:19413412
Predictive processing of novel compounds: evidence from Japanese.
Hirose, Yuki; Mazuka, Reiko
2015-03-01
Our study argues that pre-head anticipatory processing operates at a level below the level of the sentence. A visual-world eye-tracking study demonstrated that, in processing of Japanese novel compounds, the compound structure can be constructed prior to the head if the prosodic information on the preceding modifier constituent signals that the Compound Accent Rule (CAR) is being applied. This prosodic cue rules out the single head analysis of the modifier noun, which would otherwise be a natural and economical choice. Once the structural representation for the head is computed in advance, the parser becomes faster in identifying the compound meaning. This poses a challenge to models maintaining that structural integration and word recognition are separate processes. At the same time, our results, together with previous findings, suggest the possibility that there is some degree of staging during the processing of different sources of information during the comprehension of compound nouns. Copyright © 2014 Elsevier B.V. All rights reserved.
Visual attention modulates brain activation to angry voices.
Mothes-Lasch, Martin; Mentzel, Hans-Joachim; Miltner, Wolfgang H R; Straube, Thomas
2011-06-29
In accordance with influential models proposing prioritized processing of threat, previous studies have shown automatic brain responses to angry prosody in the amygdala and the auditory cortex under auditory distraction conditions. However, it is unknown whether the automatic processing of angry prosody is also observed during cross-modal distraction. The current fMRI study investigated brain responses to angry versus neutral prosodic stimuli during visual distraction. During scanning, participants were exposed to angry or neutral prosodic stimuli while visual symbols were displayed simultaneously. By means of task requirements, participants either attended to the voices or to the visual stimuli. While the auditory task revealed pronounced activation in the auditory cortex and amygdala to angry versus neutral prosody, this effect was absent during the visual task. Thus, our results show a limitation of the automaticity of the activation of the amygdala and auditory cortex to angry prosody. The activation of these areas to threat-related voices depends on modality-specific attention.
Word-initial rhotic clusters in typically developing children: European Portuguese.
Ramalho, Ana Margarida; Freitas, M João
2018-01-01
Rhotic clusters are complex structures segmentally and prosodically and are frequently one of the last structures acquired by Portuguese-speaking children. This paper describes cross-sectional data for word-initial (WI) rhotic tap clusters in typically developing 3-4- and 5-year-olds in Portugal. Additional information is provided on WI /l/ as a singleton and in clusters. A native speaker audio-recorded and transcribed single words in a story-telling task. Results for WI rhotic clusters show an age effect consistent with previous research on European Portuguese. Singleton /l/ was in advance of /l/-clusters as expected, but the tap clusters were in advance of the /l/-clusters, possibly reflecting the velarized characteristics of the lateral. The prosodic variables word stress and word length were relevant for the WI rhotic clusters: shorter words and stressed syllables showed higher accuracy. Finally, mismatches ('errors') mainly reflected negative structural constraints (deletion of C2 and epenthesis) rather than segmental constraints (substitutions).
Neural Substrates of Auditory Emotion Recognition Deficits in Schizophrenia.
Kantrowitz, Joshua T; Hoptman, Matthew J; Leitman, David I; Moreno-Ortega, Marta; Lehrfeld, Jonathan M; Dias, Elisa; Sehatpour, Pejman; Laukka, Petri; Silipo, Gail; Javitt, Daniel C
2015-11-04
Deficits in auditory emotion recognition (AER) are a core feature of schizophrenia and a key component of social cognitive impairment. AER deficits are tied behaviorally to impaired ability to interpret tonal ("prosodic") features of speech that normally convey emotion, such as modulations in base pitch (F0M) and pitch variability (F0SD). These modulations can be recreated using synthetic frequency modulated (FM) tones that mimic the prosodic contours of specific emotional stimuli. The present study investigates neural mechanisms underlying impaired AER using a combined event-related potential/resting-state functional connectivity (rsfMRI) approach in 84 schizophrenia/schizoaffective disorder patients and 66 healthy comparison subjects. Mismatch negativity (MMN) to FM tones was assessed in 43 patients/36 controls. rsfMRI between auditory cortex and medial temporal (insula) regions was assessed in 55 patients/51 controls. The relationship between AER, MMN to FM tones, and rsfMRI was assessed in the subset who performed all assessments (14 patients, 21 controls). As predicted, patients showed robust reductions in MMN across FM stimulus type (p = 0.005), particularly to modulations in F0M, along with impairments in AER and FM tone discrimination. MMN source analysis indicated dipoles in both auditory cortex and anterior insula, whereas rsfMRI analyses showed reduced auditory-insula connectivity. MMN to FM tones and functional connectivity together accounted for ∼50% of the variance in AER performance across individuals. These findings demonstrate that impaired preattentive processing of tonal information and reduced auditory-insula connectivity are critical determinants of social cognitive dysfunction in schizophrenia, and thus represent key targets for future research and clinical intervention. Schizophrenia patients show deficits in the ability to infer emotion based upon tone of voice [auditory emotion recognition (AER)] that drive impairments in social cognition and global functional outcome. This study evaluated neural substrates of impaired AER in schizophrenia using a combined event-related potential/resting-state fMRI approach. Patients showed impaired mismatch negativity response to emotionally relevant frequency modulated tones along with impaired functional connectivity between auditory and medial temporal (anterior insula) cortex. These deficits contributed in parallel to impaired AER and accounted for ∼50% of variance in AER performance. Overall, these findings demonstrate the importance of both auditory-level dysfunction and impaired auditory/insula connectivity in the pathophysiology of social cognitive dysfunction in schizophrenia. Copyright © 2015 the authors 0270-6474/15/3514910-13$15.00/0.
Tuning of Human Modulation Filters Is Carrier-Frequency Dependent
Simpson, Andrew J. R.; Reiss, Joshua D.; McAlpine, David
2013-01-01
Recent studies employing speech stimuli to investigate ‘cocktail-party’ listening have focused on entrainment of cortical activity to modulations at syllabic (5 Hz) and phonemic (20 Hz) rates. The data suggest that cortical modulation filters (CMFs) are dependent on the sound-frequency channel in which modulations are conveyed, potentially underpinning a strategy for separating speech from background noise. Here, we characterize modulation filters in human listeners using a novel behavioral method. Within an ‘inverted’ adaptive forced-choice increment detection task, listening level was varied whilst contrast was held constant for ramped increments with effective modulation rates between 0.5 and 33 Hz. Our data suggest that modulation filters are tonotopically organized (i.e., vary along the primary, frequency-organized, dimension). This suggests that the human auditory system is optimized to track rapid (phonemic) modulations at high sound-frequencies and slow (prosodic/syllabic) modulations at low frequencies. PMID:24009759
ERIC Educational Resources Information Center
Monash Univ., Clayton, Victoria (Australia).
The present compilation of papers on linguistics is the result of joint efforts by the Classical Studies, French, Japanese, Linguistics, and Russian Departments of Monash University. Selections in the Pre-Prints and Articles section include: "For/Arabic Bilingualism in the Zalingei Area," by B. Jernudd; "Prosodic Problems in a Generative Phonology…
The Style and Structure of "Minnesang"
ERIC Educational Resources Information Center
Oberlin, Adam
2012-01-01
"The Style and Structure of 'Minnesang'" approaches a broad corpus of the medieval German love lyric from the perspective of historical phraseology and formulaicity. Overturning previous concerns of prosodic restriction in verse and the misapplication of contemporary notions of fixity, the dissertation provides an overview of…
Discriminating Dysarthria Type from Envelope Modulation Spectra
ERIC Educational Resources Information Center
Liss, Julie M.; LeGendre, Sue; Lotto, Andrew J.
2010-01-01
Purpose: Previous research demonstrated the ability of temporally based rhythm metrics to distinguish among dysarthrias with different prosodic deficit profiles (J. M. Liss et al., 2009). The authors examined whether comparable results could be obtained by an automated analysis of speech envelope modulation spectra (EMS), which quantifies the…
The Best Question: Explaining the Projection Behavior of Factives
ERIC Educational Resources Information Center
Simons, Mandy; Beaver, David; Roberts, Craige; Tonhauser, Judith
2017-01-01
This article deals with projection in factive sentences. The article first challenges standard assumptions by presenting a series of detailed observations about the interpretations of factive sentences in context, showing that what implication projects, if any, is quite variable and that projection is tightly constrained by prosodic and contextual…
Intonational Phrasing Is Constrained by Meaning, Not Balance
ERIC Educational Resources Information Center
Breen, Mara; Watson, Duane G.; Gibson, Edward
2011-01-01
This paper evaluates two classes of hypotheses about how people prosodically segment utterances: (1) meaning-based proposals, with a focus on Watson and Gibson's (2004) proposal, according to which speakers tend to produce boundaries before and after long constituents; and (2) balancing proposals, according to which speakers tend to produce…
Phonotactic Acquisition in Healthy Preterm Infants
ERIC Educational Resources Information Center
Gonzalez-Gomez, Nayeli; Nazzi, Thierry
2012-01-01
Previous work has shown that preterm infants are at higher risk for cognitive/language delays than full-term infants. Recent studies, focusing on prosody (i.e. rhythm, intonation), have suggested that prosodic perception development in preterms is indexed by maturational rather than postnatal/listening age. However, because prosody is heard…
Perception and Acoustic Correlates of the Taiwanese Tone Sandhi Group
ERIC Educational Resources Information Center
Kuo, Chen-Hsiu
2013-01-01
This dissertation investigates how the Taiwanese Tone Sandhi Groups are perceived, and the acoustic/phonetics correlates of listeners' judgments. A series of perception experiments have been conducted to scrutinize the following topics--Taiwanese tone neutralization, Tone Sandhi Group (TSG) as a prosodic domain, perceived boundary strength in…
Prosody Production and Perception with Conversational Speech
ERIC Educational Resources Information Center
Mo, Yoonsook
2010-01-01
Speech utterances are more than the linear concatenation of individual phonemes or words. They are organized by prosodic structures comprising phonological units of different sizes (e.g., syllable, foot, word, and phrase) and the prominence relations among them. As the linguistic structure of spoken languages, prosody serves an important function…
Dysprosody and Stimulus Effects in Cantonese Speakers with Parkinson's Disease
ERIC Educational Resources Information Center
Ma, Joan K.-Y.; Whitehill, Tara; Cheung, Katherine S.-K.
2010-01-01
Background: Dysprosody is a common feature in speakers with hypokinetic dysarthria. However, speech prosody varies across different types of speech materials. This raises the question of what is the most appropriate speech material for the evaluation of dysprosody. Aims: To characterize the prosodic impairment in Cantonese speakers with…
Computational Prosodic Markers for Autism
ERIC Educational Resources Information Center
Van Santen, Jan P.H.; Prud'hommeaux, Emily T.; Black, Lois M.; Mitchell, Margaret
2010-01-01
We present results obtained with new instrumental methods for the acoustic analysis of prosody to evaluate prosody production by children with Autism Spectrum Disorder (ASD) and Typical Development (TD). Two tasks elicit focal stress--one in a vocal imitation paradigm, the other in a picture-description paradigm; a third task also uses a vocal…
Combining Formal and Functional Approaches to Topic Structure
ERIC Educational Resources Information Center
Zellers, Margaret; Post, Brechtje
2012-01-01
Fragmentation between formal and functional approaches to prosodic variation is an ongoing problem in linguistic research. In particular, the frameworks of the Phonetics of Talk-in-Interaction (PTI) and Empirical Phonology (EP) take very different theoretical and methodological approaches to this kind of variation. We argue that it is fruitful to…
Are Homophones Acoustically Distinguished in Child-Directed Speech?
ERIC Educational Resources Information Center
Conwell, Erin
2017-01-01
Many approaches to early word learning posit that children assume a one-to-one mapping of form and meaning. However, children's early vocabularies contain homophones, words that violate that assumption. Children might learn such words by exploiting prosodic differences between homophone meanings that are associated with lemma frequency (Gahl,…
Early Phonological and Lexical Development and Otitis Media: A Diary Study.
ERIC Educational Resources Information Center
Donahue, Mavis L.
1993-01-01
A child with chronic otitis media with effusion solved the problem of reduced and fluctuating auditory input with phonological selection and avoidance strategies that capitalized on prosodic cues. Findings illustrate the need to consider interactions among performance, input, and linguistic constraints to explain individual variation in language…
Prosodically Driven Metathesis in Mutsun
ERIC Educational Resources Information Center
Butler, Lynnika
2013-01-01
Among the many ways in which sounds alternate in the world's languages, changes in the order of sounds (metathesis) are relatively rare. Mutsun, a Southern Costanoan language of California which was documented extensively before the death of its last speaker in 1930, displays three patterns of synchronic consonant-vowel (CV) metathesis. Two of…
Cross-Modal Facilitation in Speech Prosody
ERIC Educational Resources Information Center
Foxton, Jessica M.; Riviere, Louis-David; Barone, Pascal
2010-01-01
Speech prosody has traditionally been considered solely in terms of its auditory features, yet correlated visual features exist, such as head and eyebrow movements. This study investigated the extent to which visual prosodic features are able to affect the perception of the auditory features. Participants were presented with videos of a speaker…
The Role of the Limbic System in Human Communication.
ERIC Educational Resources Information Center
Lamendella, John T.
Linguistics has chosen as its niche the language component of human communication and, naturally enough, the neurolinguist has concentrated on lateralized language systems of the cerebral hemispheres. However, decoding a speaker's total message requires attention to gestures, facial expressions, and prosodic features, as well as other somatic and…
Prosodic Reversal in Dogrib (Weledeh Dialect)
ERIC Educational Resources Information Center
Jaker, Alessandro Michelangelo
2012-01-01
This thesis presents a comprehensive phonological analysis of the Weledeh dialect of Dogrib, a Northern Athabaskan language spoken in the Northwest Territories, Canada, based on the author's own fieldwork. The phonology of Northern Athabaskan languages, and Dogrib in particular, has to date been regarded as highly irregular, and subject to…
Prenuclear Accentuation in English: Phonetics, Phonology, Information Structure
ERIC Educational Resources Information Center
Bishop, Jason Brandon
2013-01-01
A primary function of prosody in many languages is to convey information structure--the "packaging" of a sentence's content into categories such as "focus", "given" and "topic". In English and other West Germanic languages it is widely assumed that focus is signaled prosodically by the location of a…
The Prosodic Evolution of West Slavic in the Context of the Neo-Acute Stress
ERIC Educational Resources Information Center
Feldstein, Ronald F.
1975-01-01
Because of neo-acute stress--or transferred acute stress--long vowel prosody in West Slavic had a special evolution. Two kinds of long vowel evolution are examined. The nature of transitionality across Slavic territory from tonal opposition to distinctive stress placement is pointed out. (SC)
The Segmental and Suprasegmental Phonology of Fataluku
ERIC Educational Resources Information Center
Heston, Tyler M.
2015-01-01
This dissertation describes the segmental and prosodic phonology of Fataluku (IPA [fataluku], ISO 639-3 ddg), a highly underdocumented Papuan language in East Timor (island Southeast Asia). Fataluku is classified as a member of the Timor-Alor-Pantar language (TAP) family, which currently includes approximately 25 members spread across Timor and…
Development of the Prosodic Features of Infants' Vocalizing.
ERIC Educational Resources Information Center
Lane, Harlan; Sheppard, William
Traditional research methods of recording infant verbal behavior, namely, descriptions by a single observer transcribing the utterances of a single infant in a naturalistic setting, have been inadequate to provide data necessary for modern linguistic analyses. The Center for Research on Language and Language Behavior has undertaken to correct this…
Exploring Dyslexics' Phonological Deficit III: Foreign Speech Perception and Production
ERIC Educational Resources Information Center
Soroli, Efstathia; Szenkovits, Gayaneh; Ramus, Franck
2010-01-01
This study investigates French dyslexic and control adult participants' ability to perceive and produce two different non-native contrasts (one segmental and one prosodic), across several conditions varying short-term memory load. For this purpose, we selected Korean plosive voicing (whose categories conflict with French ones) as the segmental…
Improving Intelligibility: Guided Reflective Journals in Action
ERIC Educational Resources Information Center
Lear, Emmaline L.
2014-01-01
This study explores the effectiveness of guided reflective journals to improve intelligibility in a Japanese higher educational context. Based on qualitative and quantitative methods, the paper evaluates changes in speech over the duration of one semester. In particular, this study focuses on changes in prosodic features such as stress, intonation…
Phonological Phrase Boundaries Constrain Lexical Access I. Adult Data
ERIC Educational Resources Information Center
Christophe, A.; Peperkamp, S.; Pallier, C.; Block, E.; Mehler, J.
2004-01-01
We tested the effect of local lexical ambiguities while manipulating the type of prosodic boundary at which the ambiguity occurred, using French sentences and participants. We observed delayed lexical access when a local lexical ambiguity occurred within a phonological phrase (consistent with previous research; e.g., '[un chat grincheux],'…
Perspectives on Treatment for Communication Deficits Associated with Right Hemisphere Brain Damage
ERIC Educational Resources Information Center
Blake, Margaret Lehman
2007-01-01
Purpose: To describe the current treatment research for communication (prosodic, discourse, and pragmatic) deficits associated with right hemisphere brain damage and to provide suggestions for treatment selection given the paucity of evidence specifically for this population. Method: The discussion covers (a) clinical decision processes and…
Bootstrapping the Syntactic Bootstrapper: Probabilistic Labeling of Prosodic Phrases
ERIC Educational Resources Information Center
Gutman, Ariel; Dautriche, Isabelle; Crabbé, Benoît; Christophe, Anne
2015-01-01
The "syntactic bootstrapping" hypothesis proposes that syntactic structure provides children with cues for learning the meaning of novel words. In this article, we address the question of how children might start acquiring some aspects of syntax before they possess a sizeable lexicon. The study presents two models of early syntax…
Hierarchical Spatiotemporal Dynamics of Speech Rhythm and Articulation
ERIC Educational Resources Information Center
Tilsen, Samuel Edward
2009-01-01
Hierarchy is one of the most important concepts in the scientific study of language. This dissertation aims to understand why we observe hierarchical structures in speech by investigating the cognitive processes from which they emerge. To that end, the dissertation explores how articulatory, rhythmic, and prosodic patterns of speech interact.…
Evidence for Prosody in Silent Reading
ERIC Educational Resources Information Center
Gross, Jennifer; Millett, Amanda L.; Bartek, Brian; Bredell, Kyle Hampton; Winegard, Bo
2014-01-01
English speakers and expressive readers emphasize new content in an ongoing discourse. Do silent readers emphasize new content in their inner voice? Because the inner voice cannot be directly observed, we borrowed the cap-emphasis technique (e.g., "toMAYto") from the pronunciation guides of dictionaries to elicit prosodic emphasis.…
Native and Nonnative Processing of Japanese Pitch Accent
ERIC Educational Resources Information Center
Wu, Xianghua; Tu, Jung-Yueh; Wang, Yue
2012-01-01
The theoretical framework of this study is based on the prevalent debate of whether prosodic processing is influenced by higher level linguistic-specific circuits or reflects lower level encoding of physical properties. Using the dichotic listening technique, the study investigates the hemispheric processing of Japanese pitch accent by native…
Planchou, Clément; Clément, Sylvain; Béland, Renée; Cason, Nia; Motte, Jacques; Samson, Séverine
2015-01-01
Background: Previous studies have reported that children score better in language tasks using sung rather than spoken stimuli. We examined word detection ease in sung and spoken sentences that were equated for phoneme duration and pitch variations in children aged 7 to 12 years with typical language development (TLD) as well as in children with specific language impairment (SLI ), and hypothesized that the facilitation effect would vary with language abilities. Method: In Experiment 1, 69 children with TLD (7–10 years old) detected words in sentences that were spoken, sung on pitches extracted from speech, and sung on original scores. In Experiment 2, we added a natural speech rate condition and tested 68 children with TLD (7–12 years old). In Experiment 3, 16 children with SLI and 16 age-matched children with TLD were tested in all four conditions. Results: In both TLD groups, older children scored better than the younger ones. The matched TLD group scored higher than the SLI group who scored at the level of the younger children with TLD . None of the experiments showed a facilitation effect of sung over spoken stimuli. Conclusions: Word detection abilities improved with age in both TLD and SLI groups. Our findings are compatible with the hypothesis of delayed language abilities in children with SLI , and are discussed in light of the role of durational prosodic cues in words detection. PMID:26767070
Emergence of Japanese Infants' Prosodic Preferences in Infant-Directed Vocabulary
ERIC Educational Resources Information Center
Hayashi, Akiko; Mazuka, Reiko
2017-01-01
The article examines the role of infant-directed vocabulary (IDV) in infants language acquisition, specifically addressing the question of whether IDV forms that are not prominent in adult language may nonetheless be useful to the process of acquisition. Japanese IDV offers a good test case, as IDV characteristically takes a bisyllabic…
ERIC Educational Resources Information Center
Doi, Hirokazu; Fujisawa, Takashi X.; Kanai, Chieko; Ohta, Haruhisa; Yokoi, Hideki; Iwanami, Akira; Kato, Nobumasa; Shinohara, Kazuyuki
2013-01-01
This study investigated the ability of adults with Asperger syndrome to recognize emotional categories of facial expressions and emotional prosodies with graded emotional intensities. The individuals with Asperger syndrome showed poorer recognition performance for angry and sad expressions from both facial and vocal information. The group…
Comparing Measures of Voice Quality from Sustained Phonation and Continuous Speech
ERIC Educational Resources Information Center
Gerratt, Bruce R.; Kreiman, Jody; Garellek, Marc
2016-01-01
Purpose: The question of what type of utterance--a sustained vowel or continuous speech--is best for voice quality analysis has been extensively studied but with equivocal results. This study examines whether previously reported differences derive from the articulatory and prosodic factors occurring in continuous speech versus sustained phonation.…
ERIC Educational Resources Information Center
Kakouros, Sofoklis; Räsänen, Okko
2016-01-01
Numerous studies have examined the acoustic correlates of sentential stress and its underlying linguistic functionality. However, the mechanism that connects stress cues to the listener's attentional processing has remained unclear. Also, the learnability versus innateness of stress perception has not been widely discussed. In this work, we…
Training with Rhythmic Beat Gestures Benefits L2 Pronunciation in Discourse-Demanding Situations
ERIC Educational Resources Information Center
Gluhareva, Daria; Prieto, Pilar
2017-01-01
Recent research has shown that beat gestures (hand gestures that co-occur with speech in spontaneous discourse) are temporally integrated with prosodic prominence and that they help word memorization and discourse comprehension. However, little is known about the potential beneficial effects of beat gestures in second language (L2) pronunciation…
Neurology of Affective Prosody and Its Functional-Anatomic Organization in Right Hemisphere
ERIC Educational Resources Information Center
Ross, Elliott D.; Monnot, Marilee
2008-01-01
Unlike the aphasic syndromes, the organization of affective prosody in brain has remained controversial because affective-prosodic deficits may occur after left or right brain damage. However, different patterns of deficits are observed following left and right brain damage that suggest affective prosody is a dominant and lateralized function of…
Prosodic Structure Shapes the Temporal Realization of Intonation and Manual Gesture Movements
ERIC Educational Resources Information Center
Esteve-Gibert, Nuria; Prieto, Pilar
2013-01-01
Purpose: Previous work on the temporal coordination between gesture and speech found that the prominence in gesture coordinates with speech prominence. In this study, the authors investigated the anchoring regions in speech and pointing gesture that align with each other. The authors hypothesized that (a) in contrastive focus conditions, the…
Beyond the Particular: Prosody and the Coordination of Actions
ERIC Educational Resources Information Center
Szczepek Reed, Beatrice
2012-01-01
The majority of research on prosody in conversation to date has focused on exploring the role of individual prosodic features, such as certain types of pitch accent, pitch register or voice quality, for the accomplishment of specified social actions. From this research the picture emerges that when it comes to the implementation of specific…
Perception and Production of English Lexical Stress by Thai Speakers
ERIC Educational Resources Information Center
Jangjamras, Jirapat
2011-01-01
This study investigated the effects of first language prosodic transfer on the perception and production of English lexical stress and the relation between stress perception and production by second language learners. To test the effect of Thai tonal distribution rules and stress patterns on native Thai speakers' perception and production of…
Prosody and Animacy in the Development of Noun Determiner Use: A Cross-Linguistic Approach
ERIC Educational Resources Information Center
Bassano, Dominique; Korecky-Kröll, Katharina; Maillochon, Isabelle; van Dijk, Marijn; Laaha, Sabine; van Geert, Paul; Dressler, Wolfgang U.
2013-01-01
This study investigates prosodic (noun length) and lexical-semantic (animacy) influences on determiner use in the spontaneous speech of three children acquiring French, Austrian German and Dutch. In support of typological and language-specific hypotheses from the Germanic-Romance contrast, an advantage of monosyllabic nouns and of inanimate nouns…
ERIC Educational Resources Information Center
Holliman, A. J.; Williams, G. J.; Mundy, I. R.; Wood, C.; Hart, L.; Waldron, S.
2014-01-01
A growing number of studies now suggest that sensitivity to the rhythmic patterning of speech (prosody) is implicated in successful reading acquisition. However, recent evidence suggests that prosody is not a unitary construct and that the different components of prosody (stress, intonation, and timing) operating at different linguistic levels…
The Role of Intonation in Language Discrimination by Infants and Adults
ERIC Educational Resources Information Center
Vicenik, Chad Joseph
2011-01-01
It has been widely shown that infants and adults are capable of using only prosodic information to discriminate between languages. However, it remains unclear which aspects of prosody, either rhythm or intonation, listeners attend to for language discrimination. Previous researchers have suggested that rhythm, the duration and timing of speech…
Prosodic Transfer: From Chinese Lexical Tone to English Pitch Accent
ERIC Educational Resources Information Center
Ploquin, Marie
2013-01-01
Chinese tones are associated with a syllable to convey meaning, English pitch accents are prominence markers associated with stressed syllables. As both are created by pitch modulation, their pitch contours can be quite similar. The experiment reported here examines whether native speakers of Chinese produce, when speaking English, the Chinese…
ERIC Educational Resources Information Center
Wang, Wei
2017-01-01
This study investigates Mandarin discourse markers from both functional and prosodic perspectives. Discourse markers are defined as sequentially dependent elements which bracket units of talk (Schiffrin 1987). In this study, I focus on three discourse markers, "ranhou" "then", "wo juede" "I think/feel", and…
ERIC Educational Resources Information Center
Esteve-Gibert, Nuria; Prieto, Pilar
2013-01-01
There is considerable debate about whether early vocalizations mimic the target language and whether prosody signals emergent intentional communication. A longitudinal corpus of four Catalan-babbling infants was analyzed to investigate whether children use different prosodic patterns to distinguish communicative from investigative vocalizations…
Grammars Leak: Modeling How Phonotactic Generalizations Interact within the Grammar
ERIC Educational Resources Information Center
Martin, Andrew
2011-01-01
I present evidence from Navajo and English that weaker, gradient versions of morpheme-internal phonotactic constraints, such as the ban on geminate consonants in English, hold even across prosodic word boundaries. I argue that these lexical biases are the result of a MAXIMUM ENTROPY phonotactic learning algorithm that maximizes the probability of…
ERIC Educational Resources Information Center
Anderson, Alida; Lin, Candise Y.; Wang, Min
2013-01-01
Children with reading disability and normal reading development were compared in their ability to discriminate native (English) and novel language (Mandarin) from nonlinguistic sounds. Children's preference for native versus novel language sounds and for disyllables containing dominant trochaic versus non-dominant iambic stress patterns was also…
Two Arts. Revision and What It Leaves behind
ERIC Educational Resources Information Center
Booten, Kyle
2012-01-01
Inspired by an experience of teaching the drafts of Elizabeth Bishop's "One Art", this article rereads the drafts as far more than imperfect precursors to the final poem. The drafts have their own prosodic features and poetic logic, one that values and enacts a vertiginous dilation of thought, expression and memory. The final version of…
Constituent Length Affects Prosody and Processing for a Dative NP Ambiguity in Korean
ERIC Educational Resources Information Center
Hwang, Hyekyung; Schafer, Amy J.
2009-01-01
Two sentence processing experiments on a dative NP ambiguity in Korean demonstrate effects of phrase length on overt and implicit prosody. Both experiments controlled non-prosodic length factors by using long versus short proper names that occurred before the syntactically critical material. Experiment 1 found that long phrases induce different…
Say It like You Mean It: Mothers' Use of Prosody to Convey Word Meaning
ERIC Educational Resources Information Center
Herold, Debora S.; Nygaard, Lynne C.; Namy, Laura L.
2012-01-01
Prosody plays a variety of roles in infants' communicative development, aiding in attention modulation, speech segmentation, and syntax acquisition. This study investigates the extent to which parents also spontaneously modulate prosodic aspects of infant directed speech in ways that distinguish semantic aspects of language. Fourteen mothers of…
Prosodic and Lexical Marking of Contrast in L2 Italian
ERIC Educational Resources Information Center
Turco, Giuseppina; Dimroth, Christine; Braun, Bettina
2015-01-01
We investigated the second language (L2) acquisition of pragmatic categories that are not as consistently and frequently encoded in the L2 than in the first language (L1). Experiment 1 showed that Italian speakers linguistically highlighted affirmative polarity contrast (e.g. "The child ate the candies" following after "The child…
ERIC Educational Resources Information Center
So, Connie K.; Best, Catherine T.
2014-01-01
This study examined how native speakers of Australian English and French, nontone languages with different lexical stress properties, perceived Mandarin tones in a sentence environment according to their native sentence intonation categories (i-Categories) in connected speech. Results showed that both English and French speakers categorized…
Time-Driven Effects on Parsing during Reading
ERIC Educational Resources Information Center
Roll, Mikael; Lindgren, Magnus; Alter, Kai; Horne, Merle
2012-01-01
The phonological trace of perceived words starts fading away in short-term memory after a few seconds. Spoken utterances are usually 2-3 s long, possibly to allow the listener to parse the words into coherent prosodic phrases while they still have a clear representation. Results from this brain potential study suggest that even during silent…
Responses to Intensity-Shifted Auditory Feedback during Running Speech
ERIC Educational Resources Information Center
Patel, Rupal; Reilly, Kevin J.; Archibald, Erin; Cai, Shanqing; Guenther, Frank H.
2015-01-01
Purpose: Responses to intensity perturbation during running speech were measured to understand whether prosodic features are controlled in an independent or integrated manner. Method: Nineteen English-speaking healthy adults (age range = 21-41 years) produced 480 sentences in which emphatic stress was placed on either the 1st or 2nd word. One…
Double Consonants in English: Graphemic, Morphological, Prosodic and Etymological Determinants
ERIC Educational Resources Information Center
Berg, Kristian
2016-01-01
What determines consonant doubling in English? This question is pursued by using a large lexical database to establish systematic correlations between spelling, phonology and morphology. The main insights are: Consonant doubling is most regular at morpheme boundaries. It can be described in graphemic terms alone, i.e. without reference to…
ERIC Educational Resources Information Center
Bouchon, Camillia; Floccia, Caroline; Fux, Thibaut; Adda-Decker, Martine; Nazzi, Thierry
2015-01-01
Consonants and vowels differ acoustically and articulatorily, but also functionally: Consonants are more relevant for lexical processing, and vowels for prosodic/syntactic processing. These functional biases could be powerful bootstrapping mechanisms for learning language, but their developmental origin remains unclear. The relative importance of…
Prosodic Disambiguation of Noun/Verb Homophones in Child-Directed Speech
ERIC Educational Resources Information Center
Conwell, Erin
2017-01-01
One strategy that children might use to sort words into grammatical categories such as noun and verb is distributional bootstrapping, in which local co-occurrence information is used to distinguish between categories. Words that can be used in more than one grammatical category could be problematic for this approach. Using naturalistic corpus…
ERIC Educational Resources Information Center
Monaghan, Padraic; Christiansen, Morten H.; Chater, Nick
2007-01-01
Several phonological and prosodic properties of words have been shown to relate to differences between grammatical categories. Distributional information about grammatical categories is also a rich source in the child's language environment. In this paper we hypothesise that such cues operate in tandem for developing the child's knowledge about…
La forme en -l en macedonien (The Form Suffixed -1 in Macedonian).
ERIC Educational Resources Information Center
Hristova, Doreana
The Macedonian verb form corresponding to the form ending in "-l" in French is examined, focusing on the active past participle, which represents the past indeterminate or non-testimonial tense. Special attention is paid to aspectual, modal, temporal, and prosodic values, and all examples are drawn from the two languages. (MSE)
Error Patterns in Young German Children's "Wh"-Questions
ERIC Educational Resources Information Center
Schmerse, Daniel; Lieven, Elena; Tomasello, Michael
2013-01-01
In this article we report two studies: a detailed longitudinal analysis of errors in "wh"-questions from six German-learning children (age 2 ; 0-3 ; 0) and an analysis of the prosodic characteristics of "wh"-questions in German child-directed speech. The results of the first study demonstrate that German-learning children…
Telehealth Delivery of Rapid Syllable Transitions (ReST) Treatment for Childhood Apraxia of Speech
ERIC Educational Resources Information Center
Thomas, Donna C.; McCabe, Patricia; Ballard, Kirrie J.; Lincoln, Michelle
2016-01-01
Background: Rapid Syllable Transitions (ReST) treatment uses pseudo-word targets with varying lexical stress to target simultaneously articulation, prosodic accuracy and coarticulatory transitions in childhood apraxia of speech (CAS). The treatment is efficacious for the acquisition of imitated pseudo-words, and generalization of skill to…
Sallat, Stephan; Jentschke, Sebastian
2015-01-01
Language and music share many properties, with a particularly strong overlap for prosody. Prosodic cues are generally regarded as crucial for language acquisition. Previous research has indicated that children with SLI fail to make use of these cues. As processing of prosodic information involves similar skills to those required in music perception, we compared music perception skills (melodic and rhythmic-melodic perception and melody recognition) in a group of children with SLI (N = 29, five-year-olds) to two groups of controls, either of comparable age (N = 39, five-year-olds) or of age closer to the children with SLI in their language skills and about one year younger (N = 13, four-year-olds). Children with SLI performed in most tasks below their age level, closer matching the performance level of younger controls with similar language skills. These data strengthen the view of a strong relation between language acquisition and music processing. This might open a perspective for the possible use of musical material in early diagnosis of SLI and of music in SLI therapy.
Fonseca, Rochele Paz; Fachel, Jandyra Maria Guimarães; Chaves, Márcia Lorena Fagundes; Liedtke, Francéia Veiga; Parente, Maria Alice de Mattos Pimenta
2007-01-01
Right-brain-damaged individuals may present discursive, pragmatic, lexical-semantic and/or prosodic disorders. Objective To verify the effect of right hemisphere damage on communication processing evaluated by the Brazilian version of the Protocole Montréal d’Évaluation de la Communication (Montreal Communication Evaluation Battery) – Bateria Montreal de Avaliação da Comunicação, Bateria MAC, in Portuguese. Methods A clinical group of 29 right-brain-damaged participants and a control group of 58 non-brain-damaged adults formed the sample. A questionnaire on sociocultural and health aspects, together with the Brazilian MAC Battery was administered. Results Significant differences between the clinical and control groups were observed in the following MAC Battery tasks: conversational discourse, unconstrained, semantic and orthographic verbal fluency, linguistic prosody repetition, emotional prosody comprehension, repetition and production. Moreover, the clinical group was less homogeneous than the control group. Conclusions A right-brain-damage effect was identified directly, on three communication processes: discursive, lexical-semantic and prosodic processes, and indirectly, on pragmatic process. PMID:29213400
Emotion to emotion speech conversion in phoneme level
NASA Astrophysics Data System (ADS)
Bulut, Murtaza; Yildirim, Serdar; Busso, Carlos; Lee, Chul Min; Kazemzadeh, Ebrahim; Lee, Sungbok; Narayanan, Shrikanth
2004-10-01
Having an ability to synthesize emotional speech can make human-machine interaction more natural in spoken dialogue management. This study investigates the effectiveness of prosodic and spectral modification in phoneme level on emotion-to-emotion speech conversion. The prosody modification is performed with the TD-PSOLA algorithm (Moulines and Charpentier, 1990). We also transform the spectral envelopes of source phonemes to match those of target phonemes using LPC-based spectral transformation approach (Kain, 2001). Prosodic speech parameters (F0, duration, and energy) for target phonemes are estimated from the statistics obtained from the analysis of an emotional speech database of happy, angry, sad, and neutral utterances collected from actors. Listening experiments conducted with native American English speakers indicate that the modification of prosody only or spectrum only is not sufficient to elicit targeted emotions. The simultaneous modification of both prosody and spectrum results in higher acceptance rates of target emotions, suggesting that not only modeling speech prosody but also modeling spectral patterns that reflect underlying speech articulations are equally important to synthesize emotional speech with good quality. We are investigating suprasegmental level modifications for further improvement in speech quality and expressiveness.
Is Mandarin Chinese a Truth-Based Language? Rejecting Responses to Negative Assertions and Questions
Li, Feifei; González-Fuente, Santiago; Prieto, Pilar; Espinal, M.Teresa
2016-01-01
This paper addresses the central question of whether Mandarin Chinese (MC) is a canonical truth-based language, a language that is expected to express the speaker's disagreement to a negative proposition by means of a negative particle followed by a positive sentence. Eight native speakers of MC participated in an oral Discourse Completion Task that elicited rejecting responses to negative assertions/questions and broad focus statements (control condition). Results show that MC speakers convey reject by relying on a combination of lexico-syntactic strategies (e.g., negative particles such as bù, méi(yǒu), and positive sentences) together with prosodic (e.g., mean pitch) and gestural strategies (mainly, the use of head nods). Importantly, the use of a negative particle, which was the expected outcome in truth-based languages, only appeared in 52% of the rejecting answers. This system puts into question the macroparametric division between truth-based and polarity-based languages and calls for a more general view of the instantiation of a reject speech act that integrates lexical and syntactic strategies with prosodic and gestural strategies. PMID:28066292
Sallat, Stephan; Jentschke, Sebastian
2015-01-01
Language and music share many properties, with a particularly strong overlap for prosody. Prosodic cues are generally regarded as crucial for language acquisition. Previous research has indicated that children with SLI fail to make use of these cues. As processing of prosodic information involves similar skills to those required in music perception, we compared music perception skills (melodic and rhythmic-melodic perception and melody recognition) in a group of children with SLI (N = 29, five-year-olds) to two groups of controls, either of comparable age (N = 39, five-year-olds) or of age closer to the children with SLI in their language skills and about one year younger (N = 13, four-year-olds). Children with SLI performed in most tasks below their age level, closer matching the performance level of younger controls with similar language skills. These data strengthen the view of a strong relation between language acquisition and music processing. This might open a perspective for the possible use of musical material in early diagnosis of SLI and of music in SLI therapy. PMID:26508812
Doi, Hirokazu; Fujisawa, Takashi X; Kanai, Chieko; Ohta, Haruhisa; Yokoi, Hideki; Iwanami, Akira; Kato, Nobumasa; Shinohara, Kazuyuki
2013-09-01
This study investigated the ability of adults with Asperger syndrome to recognize emotional categories of facial expressions and emotional prosodies with graded emotional intensities. The individuals with Asperger syndrome showed poorer recognition performance for angry and sad expressions from both facial and vocal information. The group difference in facial expression recognition was prominent for stimuli with low or intermediate emotional intensities. In contrast to this, the individuals with Asperger syndrome exhibited lower recognition accuracy than typically-developed controls mainly for emotional prosody with high emotional intensity. In facial expression recognition, Asperger and control groups showed an inversion effect for all categories. The magnitude of this effect was less in the Asperger group for angry and sad expressions, presumably attributable to reduced recruitment of the configural mode of face processing. The individuals with Asperger syndrome outperformed the control participants in recognizing inverted sad expressions, indicating enhanced processing of local facial information representing sad emotion. These results suggest that the adults with Asperger syndrome rely on modality-specific strategies in emotion recognition from facial expression and prosodic information.
Prosody Predicts Contest Outcome in Non-Verbal Dialogs
Dreiss, Amélie N.; Chatelain, Philippe G.
2016-01-01
Non-verbal communication has important implications for inter-individual relationships and negotiation success. However, to what extent humans can spontaneously use rhythm and prosody as a sole communication tool is largely unknown. We analysed human ability to resolve a conflict without verbal dialogs, independently of semantics. We invited pairs of subjects to communicate non-verbally using whistle sounds. Along with the production of more whistles, participants unwittingly used a subtle prosodic feature to compete over a resource (ice-cream scoops). Winners can be identified by their propensity to accentuate the first whistles blown when replying to their partner, compared to the following whistles. Naive listeners correctly identified this prosodic feature as a key determinant of which whistler won the interaction. These results suggest that in the absence of other communication channels, individuals spontaneously use a subtle variation of sound accentuation (prosody), instead of merely producing exuberant sounds, to impose themselves in a conflict of interest. We discuss the biological and cultural bases of this ability and their link with verbal communication. Our results highlight the human ability to use non-verbal communication in a negotiation process. PMID:27907039
Prosody Predicts Contest Outcome in Non-Verbal Dialogs.
Dreiss, Amélie N; Chatelain, Philippe G; Roulin, Alexandre; Richner, Heinz
2016-01-01
Non-verbal communication has important implications for inter-individual relationships and negotiation success. However, to what extent humans can spontaneously use rhythm and prosody as a sole communication tool is largely unknown. We analysed human ability to resolve a conflict without verbal dialogs, independently of semantics. We invited pairs of subjects to communicate non-verbally using whistle sounds. Along with the production of more whistles, participants unwittingly used a subtle prosodic feature to compete over a resource (ice-cream scoops). Winners can be identified by their propensity to accentuate the first whistles blown when replying to their partner, compared to the following whistles. Naive listeners correctly identified this prosodic feature as a key determinant of which whistler won the interaction. These results suggest that in the absence of other communication channels, individuals spontaneously use a subtle variation of sound accentuation (prosody), instead of merely producing exuberant sounds, to impose themselves in a conflict of interest. We discuss the biological and cultural bases of this ability and their link with verbal communication. Our results highlight the human ability to use non-verbal communication in a negotiation process.
The roles of long-term phonotactic and lexical prosodic knowledge in phonological short-term memory.
Tanida, Yuki; Ueno, Taiji; Lambon Ralph, Matthew A; Saito, Satoru
2015-04-01
Many previous studies have explored and confirmed the influence of long-term phonological representations on phonological short-term memory. In most investigations, phonological effects have been explored with respect to phonotactic constraints or frequency. If interaction between long-term memory and phonological short-term memory is a generalized principle, then other phonological characteristics-that is, suprasegmental aspects of phonology-should also exert similar effects on phonological short-term memory. We explored this hypothesis through three immediate serial-recall experiments that manipulated Japanese nonwords with respect to lexical prosody (pitch-accent type, reflecting suprasegmental characteristics) as well as phonotactic frequency (reflecting segmental characteristics). The results showed that phonotactic frequency affected the retention not only of the phonemic sequences, but also of pitch-accent patterns, when participants were instructed to recall both the phoneme sequence and accent pattern of nonwords. In addition, accent pattern typicality influenced the retention of the accent pattern: Typical accent patterns were recalled more accurately than atypical ones. These results indicate that both long-term phonotactic and lexical prosodic knowledge contribute to phonological short-term memory performance.
Prosody perception and musical pitch discrimination in adults using cochlear implants.
Kalathottukaren, Rose Thomas; Purdy, Suzanne C; Ballard, Elaine
2015-07-01
This study investigated prosodic perception and musical pitch discrimination in adults using cochlear implants (CI), and examined the relationship between prosody perception scores and non-linguistic auditory measures, demographic variables, and speech recognition scores. Participants were given four subtests of the PEPS-C (profiling elements of prosody in speech-communication), the adult paralanguage subtest of the DANVA 2 (diagnostic analysis of non verbal accuracy 2), and the contour and interval subtests of the MBEA (Montreal battery of evaluation of amusia). Twelve CI users aged 25;5 to 78;0 years participated. CI participants performed significantly more poorly than normative values for New Zealand adults for PEPS-C turn-end, affect, and contrastive stress reception subtests, but were not different from the norm for the chunking reception subtest. Performance on the DANVA 2 adult paralanguage subtest was lower than the normative mean reported by Saindon (2010) . Most of the CI participants performed at chance level on both MBEA subtests. CI users have difficulty perceiving prosodic information accurately. Difficulty in understanding different aspects of prosody and music may be associated with reduced pitch perception ability.
Leong, Victoria; Goswami, Usha
2014-02-01
Developmental dyslexia is associated with rhythmic difficulties, including impaired perception of beat patterns in music and prosodic stress patterns in speech. Spoken prosodic rhythm is cued by slow (<10 Hz) fluctuations in speech signal amplitude. Impaired neural oscillatory tracking of these slow amplitude modulation (AM) patterns is one plausible source of impaired rhythm tracking in dyslexia. Here, we characterise the temporal profile of the dyslexic rhythm deficit by examining rhythmic entrainment at multiple speech timescales. Adult dyslexic participants completed two experiments aimed at testing the perception and production of speech rhythm. In the perception task, participants tapped along to the beat of 4 metrically-regular nursery rhyme sentences. In the production task, participants produced the same 4 sentences in time to a metronome beat. Rhythmic entrainment was assessed using both traditional rhythmic indices and a novel AM-based measure, which utilised 3 dominant AM timescales in the speech signal each associated with a different phonological grain-sized unit (0.9-2.5 Hz, prosodic stress; 2.5-12 Hz, syllables; 12-40 Hz, phonemes). The AM-based measure revealed atypical rhythmic entrainment by dyslexic participants to syllable patterns in speech, in perception and production. In the perception task, both groups showed equally strong phase-locking to Syllable AM patterns, but dyslexic responses were entrained to a significantly earlier oscillatory phase angle than controls. In the production task, dyslexic utterances showed shorter syllable intervals, and differences in Syllable:Phoneme AM cross-frequency synchronisation. Our data support the view that rhythmic entrainment at slow (∼5 Hz, Syllable) rates is atypical in dyslexia, suggesting that neural mechanisms for syllable perception and production may also be atypical. These syllable timing deficits could contribute to the atypical development of phonological representations for spoken words, the central cognitive characteristic of developmental dyslexia across languages. Copyright © 2013 The Authors. Published by Elsevier B.V. All rights reserved.
Leong, Victoria; Goswami, Usha
2014-01-01
Developmental dyslexia is associated with rhythmic difficulties, including impaired perception of beat patterns in music and prosodic stress patterns in speech. Spoken prosodic rhythm is cued by slow (<10 Hz) fluctuations in speech signal amplitude. Impaired neural oscillatory tracking of these slow amplitude modulation (AM) patterns is one plausible source of impaired rhythm tracking in dyslexia. Here, we characterise the temporal profile of the dyslexic rhythm deficit by examining rhythmic entrainment at multiple speech timescales. Adult dyslexic participants completed two experiments aimed at testing the perception and production of speech rhythm. In the perception task, participants tapped along to the beat of 4 metrically-regular nursery rhyme sentences. In the production task, participants produced the same 4 sentences in time to a metronome beat. Rhythmic entrainment was assessed using both traditional rhythmic indices and a novel AM-based measure, which utilised 3 dominant AM timescales in the speech signal each associated with a different phonological grain-sized unit (0.9–2.5 Hz, prosodic stress; 2.5–12 Hz, syllables; 12–40 Hz, phonemes). The AM-based measure revealed atypical rhythmic entrainment by dyslexic participants to syllable patterns in speech, in perception and production. In the perception task, both groups showed equally strong phase-locking to Syllable AM patterns, but dyslexic responses were entrained to a significantly earlier oscillatory phase angle than controls. In the production task, dyslexic utterances showed shorter syllable intervals, and differences in Syllable:Phoneme AM cross-frequency synchronisation. Our data support the view that rhythmic entrainment at slow (∼5 Hz, Syllable) rates is atypical in dyslexia, suggesting that neural mechanisms for syllable perception and production may also be atypical. These syllable timing deficits could contribute to the atypical development of phonological representations for spoken words, the central cognitive characteristic of developmental dyslexia across languages. This article is part of a Special Issue entitled
Implicit Prosody and Cue-based Retrieval: L1 and L2 Agreement and Comprehension during Reading
Pratt, Elizabeth; Fernández, Eva M.
2016-01-01
This project focuses on structural and prosodic effects during reading, examining their influence on agreement processing and comprehension in native English (L1) and Spanish–English bilingual (L2) speakers. We consolidate research from several distinct areas of inquiry—cognitive processing, reading fluency, and L1/L2 processing—in order to support the integration of prosody with a cue-based retrieval mechanism for subject-verb agreement. To explore this proposal, the experimental design manipulated text presentation to influence implicit prosody, using sentences designed to induce subject-verb agreement attraction errors. Materials included simple and complex relative clauses with head nouns and verbs that were either matched or mismatched for number. Participants read items in one of three presentation formats (whole sentence, word-by-word, or phrase-by-phrase), rated each item for grammaticality, and responded to a comprehension probe. Results indicated that while overall, message comprehension was prioritized over subject-verb agreement computation, presentation format differentially affected both measures in the L1 and L2 groups. For the L1 participants, facilitating the projection of phrasal prosody onto text (phrase-by-phrase presentation) enhanced performance in agreement processing, while disrupting prosodic projection via word-by-word presentation decreased comprehension accuracy. For the L2 participants, however, phrase-by-phrase presentation was not significantly beneficial for agreement processing, and additionally resulted in lower comprehension accuracy. These differences point to a significant role of prosodic phrasing during agreement processing in both L1 and L2 speakers, additionally suggesting that it may contribute to a cue-based retrieval agreement model, either acting as a cue directly, or otherwise scaffolding the retrieval process. The discussion and results presented provide support both for a cue-based retrieval mechanism in agreement, and the function of prosody within such a mechanism, adding further insight into the interaction of retrieval processes, cognitive task load, and the role of implicit prosody. PMID:28018264
Implicit Prosody and Cue-based Retrieval: L1 and L2 Agreement and Comprehension during Reading.
Pratt, Elizabeth; Fernández, Eva M
2016-01-01
This project focuses on structural and prosodic effects during reading, examining their influence on agreement processing and comprehension in native English (L1) and Spanish-English bilingual (L2) speakers. We consolidate research from several distinct areas of inquiry-cognitive processing, reading fluency, and L1/L2 processing-in order to support the integration of prosody with a cue-based retrieval mechanism for subject-verb agreement. To explore this proposal, the experimental design manipulated text presentation to influence implicit prosody, using sentences designed to induce subject-verb agreement attraction errors. Materials included simple and complex relative clauses with head nouns and verbs that were either matched or mismatched for number. Participants read items in one of three presentation formats (whole sentence, word-by-word, or phrase-by-phrase), rated each item for grammaticality, and responded to a comprehension probe. Results indicated that while overall, message comprehension was prioritized over subject-verb agreement computation, presentation format differentially affected both measures in the L1 and L2 groups. For the L1 participants, facilitating the projection of phrasal prosody onto text (phrase-by-phrase presentation) enhanced performance in agreement processing, while disrupting prosodic projection via word-by-word presentation decreased comprehension accuracy. For the L2 participants, however, phrase-by-phrase presentation was not significantly beneficial for agreement processing, and additionally resulted in lower comprehension accuracy. These differences point to a significant role of prosodic phrasing during agreement processing in both L1 and L2 speakers, additionally suggesting that it may contribute to a cue-based retrieval agreement model, either acting as a cue directly, or otherwise scaffolding the retrieval process. The discussion and results presented provide support both for a cue-based retrieval mechanism in agreement, and the function of prosody within such a mechanism, adding further insight into the interaction of retrieval processes, cognitive task load, and the role of implicit prosody.
Neural measures of the role of affective prosody in empathy for pain.
Meconi, Federica; Doro, Mattia; Lomoriello, Arianna Schiano; Mastrella, Giulia; Sessa, Paola
2018-01-10
Emotional communication often needs the integration of affective prosodic and semantic components from speech and the speaker's facial expression. Affective prosody may have a special role by virtue of its dual-nature; pre-verbal on one side and accompanying semantic content on the other. This consideration led us to hypothesize that it could act transversely, encompassing a wide temporal window involving the processing of facial expressions and semantic content expressed by the speaker. This would allow powerful communication in contexts of potential urgency such as witnessing the speaker's physical pain. Seventeen participants were shown with faces preceded by verbal reports of pain. Facial expressions, intelligibility of the semantic content of the report (i.e., participants' mother tongue vs. fictional language) and the affective prosody of the report (neutral vs. painful) were manipulated. We monitored event-related potentials (ERPs) time-locked to the onset of the faces as a function of semantic content intelligibility and affective prosody of the verbal reports. We found that affective prosody may interact with facial expressions and semantic content in two successive temporal windows, supporting its role as a transverse communication cue.
Why Do Children Pay More Attention to Grammatical Morphemes at the Ends of Sentences?
ERIC Educational Resources Information Center
Sundara, Megha
2018-01-01
Children pay more attention to the beginnings and ends of sentences rather than the middle. In natural speech, ends of sentences are prosodically and segmentally enhanced; they are also privileged by sensory and recall advantages. We contrasted whether acoustic enhancement or sensory and recall-related advantages are necessary and sufficient for…
The Influence of Gujarati and Tamil L1s on Indian English: A Preliminary Study
ERIC Educational Resources Information Center
Wiltshire, Caroline R.; Harnsberger, James D.
2006-01-01
English as spoken as a second language in India has developed distinct sound patterns in terms of both segmental and prosodic characteristics. We investigate the differences between two groups varying in native language (Gujarati, Tamil) to evaluate to what extent Indian English (IE) accents are based on a single target phonological-phonetic…
A Cross-Sectional Study of Fluency and Reading Comprehension in Spanish Primary School Children
ERIC Educational Resources Information Center
Calet, Nuria; Gutiérrez-Palma, Nicolás; Defior, Sylvia
2015-01-01
The importance of prosodic elements is recognised in most definitions of fluency. Although speed and accuracy have been typically considered the constituents of reading fluency, prosody is emerging as an additional component. The relevance of prosody in comprehension is increasingly recognised in the latest studies. The purpose of this research is…
Effects of Prosodically Modulated Sub-Phonetic Variation on Lexical Competition
ERIC Educational Resources Information Center
Salverda, Anne Pier; Dahan, Delphine; Tanenhaus, Michael K.; Crosswhite, Katherine; Masharov, Mikhail; McDonough, Joyce
2007-01-01
Eye movements were monitored as participants followed spoken instructions to manipulate one of four objects pictured on a computer screen. Target words occurred in utterance-medial (e.g., "Put the cap next to the square") or utterance-final position (e.g., "Now click on the cap"). Displays consisted of the target picture (e.g., a cap), a…
Perception of Mandarin Tones: The Effect of L1 Background and Training
ERIC Educational Resources Information Center
Wang, Xinchun
2013-01-01
This study investigates whether native Hmong speakers' first language (L1) lexical tone experience facilitates or interferes with their perception of Mandarin tones and whether training is effective for perceptual learning of second (L2) tones. In Experiment 1, 3 groups of beginning level learners of Mandarin with different L1 prosodic background…
ERIC Educational Resources Information Center
Delgado Algarra, Emilio José
2016-01-01
Most of the studies focus on the teaching of foreign languages indicate that little attention is paid to the prosodic features both didactic materials and teaching-learning processes (Martinsen, Avord and Tanner, 2014). In this context and throughout this article, an analysis of the didactical and technical dimensions of OJAD (Japanese Accent…
Lexical Encoding of L2 Tones: The Role of L1 Stress, Pitch Accent and Intonation
ERIC Educational Resources Information Center
Braun, Bettina; Galts, Tobias; Kabak, Baris
2014-01-01
Native language prosodic structure is known to modulate the processing of non-native suprasegmental information. It has been shown that native speakers of French, a language without lexical stress, have difficulties storing non-native stress contrasts. We investigated whether the ability to store lexical tone (as in Mandarin Chinese) also depends…
ERIC Educational Resources Information Center
Ménard, Lucie; Leclerc, Annie; Tiede, Mark
2014-01-01
Purpose: The role of vision in speech representation was investigated in congenitally blind speakers and sighted speakers by studying the correlates of contrastive focus, a prosodic condition in which phonemic contrasts are enhanced. It has been reported that the lips (visible articulators) are less involved in implementing the rounding feature…
ERIC Educational Resources Information Center
Vanrell, Maria del Mar; Mascaro, Ignasi; Torres-Tamarit, Francesc; Prieto, Pilar
2013-01-01
Recent studies in the field of intonational phonology have shown that information-seeking questions can be distinguished from confirmation-seeking questions by prosodic means in a variety of languages (Armstrong, 2010, for Puerto Rican Spanish; Grice & Savino, 1997, for Bari Italian; Kugler, 2003, for Leipzig German; Mata & Santos, 2010, for…
ERIC Educational Resources Information Center
Techentin, Cheryl; Voyer, Daniel; Klein, Raymond M.
2009-01-01
The present study investigated the influence of within- and between-ear congruency on interference and laterality effects in an auditory semantic/prosodic conflict task. Participants were presented dichotically with words (e.g., mad, sad, glad) pronounced in either congruent or incongruent emotional tones (e.g., angry, happy, or sad) and…
ERIC Educational Resources Information Center
Hwang, Hyekyung; Steinhauer, Karsten
2011-01-01
In spoken language comprehension, syntactic parsing decisions interact with prosodic phrasing, which is directly affected by phrase length. Here we used ERPs to examine whether a similar effect holds for the on-line processing of written sentences during silent reading, as suggested by theories of "implicit prosody." Ambiguous Korean sentence…
The Role of Prosody and Explicit Instruction in Processing Instruction
ERIC Educational Resources Information Center
Henry, Nick; Jackson, Carrie N.; Dimidio, Jack
2017-01-01
This study investigates the role of prosodic cues and explicit information (EI) in the acquisition of German accusative case markers. We compared 4 groups of 3rd-semester learners (low intermediate level) who completed 1 of 4 Processing Instruction (PI) treatments that manipulated the presence or absence of EI and focused prosody. The results…
Prosody as a Tool for Assessing Reading Fluency of Adult ESL Students
ERIC Educational Resources Information Center
Sinambela, Seftirina Evina
2017-01-01
The prosodic features in reading aloud assignment has been associated with the students' decoding skill. The goal of the present study is to determine the reliability of prosody for assessing reading fluency of adult ESL students in Indonesia context. The participants were all Indonesian natives, undergraduate students, adult females and males who…
ERIC Educational Resources Information Center
Krauss, Michael, Ed.
Nine papers on Yupik Eskimo prosody systems are presented. An introductory section gives background information on the Yupik language and dialects, defines prosody, and provides notes on orthography. The papers include: "A History of the Study of Yupik Prosody" (Michael Krauss); "Siberian Yupik and Central Yupik Prosody"…
ERIC Educational Resources Information Center
Weathers, Monica D.; Frank, Elaine M.; Spell, Leigh Ann
2002-01-01
Examined African Americans' and Whites' ability to recognize facial expressions and vocal prosody of predominantly white stimuli at three age groups (children, young adults, and adults). Race was a significant factor in interpreting facial expressions and prosodic features. Individuals from specific ethnic groups were most accurate in decoding…
How Salient Are Onomatopoeia in the Early Input? A Prosodic Analysis of Infant-Directed Speech
ERIC Educational Resources Information Center
Laing, Catherine E.; Vihman, Marilyn; Keren-Portnoy, Tamar
2017-01-01
Onomatopoeia are frequently identified amongst infants' earliest words (Menn & Vihman, 2011), yet few authors have considered why this might be, and even fewer have explored this phenomenon empirically. Here we analyze mothers' production of onomatopoeia in infant-directed speech (IDS) to provide an input-based perspective on these forms.…
Articulator Movement Associated with the Development of Prosodic Control in Children
ERIC Educational Resources Information Center
Grigos, Maria I.; Patel, Rupal
2007-01-01
This study explored the relationship between articulator movement and prosody in children at different developmental ages. Jaw, lower lip, and upper lip kinematics were examined in 4-, 7-, and 11-year-old children as they produced the declarative and interrogative forms of utterances "Show Bob a bot" and "Show Pop a pot." Articulator movement…
ERIC Educational Resources Information Center
Reimchen, Melissa; Soderstrom, Melanie
2017-01-01
Maternal questions play a crucial role in early language acquisition by virtue of their special grammatical, prosodic and lexical forms, and their abundance in the input. Infants are able to discriminate questions from other sentence types and produce rising intonations in their own requests. This study examined whether caregiver questions were…
Second Language Prosody and Oral Reading Comprehension in Learners of Brazilian Portuguese
ERIC Educational Resources Information Center
McCune, W. M. Duce, II
2011-01-01
Learning to read can pose a major challenge to students, and much of this challenge is due to the fact that written language is necessarily impoverished when compared to the rich, continuous speech signal. Prosodic elements of language are scarcely represented in written text, and while oral reading prosody has been addressed in the literature…
Effects of Prosodic and Lexical Constraints on Parsing in Young Children (and Adults)
ERIC Educational Resources Information Center
Snedeker, Jesse; Yuan, Sylvia
2008-01-01
Prior studies of ambiguity resolution in young children have found that children rely heavily on lexical information but persistently fail to use referential constraints in online parsing [Trueswell, J.C., Sekerina, I., Hill, N.M., & Logrip, M.L, (1999). The kindergarten-path effect: Studying on-line sentence processing in young children.…
The Use of Segmentation Cues in Second Language Learners of English
ERIC Educational Resources Information Center
Lin, Candise Yue
2013-01-01
This dissertation project examined the influence of language typology on the use of segmentation cues by second language (L2) learners of English. Previous research has shown that native English speakers rely more on sentence context and lexical knowledge than segmental (i.e. phonotactics or acoustic-phonetics) or prosodic cues (e.g., word stress)…
ERIC Educational Resources Information Center
Theodore, Rachel M.; Demuth, Katherine; Shattuck-Hufnagel, Stefanie
2015-01-01
Purpose: Prosodic and articulatory factors influence children's production of inflectional morphemes. For example, plural -"s" is produced more reliably in utterance-final compared to utterance-medial position (i.e., the positional effect), which has been attributed to the increased planning time in utterance-final position. In previous…
Preschool Children's Performance on Profiling Elements of Prosody in Speech-Communication (PEPS-C)
ERIC Educational Resources Information Center
Gibbon, Fiona E.; Smyth, Heather
2013-01-01
Profiling Elements of Prosody in Speech-Communication (PEPS-C) has not been used widely to assess prosodic abilities of preschool children. This study was therefore aimed at investigating typically developing 4-year-olds' performance on PEPS-C. PEPS-C was presented to 30 typically developing 4-year-olds recruited in southern Ireland. Children were…
An EMA/EPG Study of Vowel-to-Vowel Articulation across Velars in Southern British English
ERIC Educational Resources Information Center
Fletcher, Janet
2004-01-01
Recent studies have attested that the extent of transconsonantal vowel-to-vowel coarticulation is at least partly dependent on degree of prosodic accentuation, in languages like English. A further important factor is the mutual compatibility of consonant and vowel gestures associated with the segments in question. In this study two speakers of…
Cross-Linguistic Expression of Contrastive Accent: Clinical Assessment in Spanish and English
ERIC Educational Resources Information Center
Martinez-Castilla, Pastora; Peppe, Sue
2010-01-01
Well-documented Romance-Germanic differences in the use of accent in speech to convey information-structure and focus cause problems for the assessment of prosodic skills in populations with clinical disorders. The strategies for assessing the ability to use lexical and contrastive accent in English and Spanish are reviewed, and studies in the…
Intonational Division of a Speech Flow in the Kazakh Language
ERIC Educational Resources Information Center
Bazarbayeva, Zeynep M.; Zhalalova, Akshay M.; Ormakhanova, Yenlik N.; Ospangaziyeva, Nazgul B.; Karbozova, Bulbul D.
2016-01-01
The purpose of this research is to analyze the speech intonation of the French, Kazakh, English and Russian languages. The study considers intonation component functions (of melodics, duration, and intensity) in poetry and language spoken. It is defined that a set of prosodic means are used in order to convey the intonational specifics of sounding…
Bouchon, Camillia; Floccia, Caroline; Fux, Thibaut; Adda-Decker, Martine; Nazzi, Thierry
2015-07-01
Consonants and vowels differ acoustically and articulatorily, but also functionally: Consonants are more relevant for lexical processing, and vowels for prosodic/syntactic processing. These functional biases could be powerful bootstrapping mechanisms for learning language, but their developmental origin remains unclear. The relative importance of consonants and vowels at the onset of lexical acquisition was assessed in French-learning 5-month-olds by testing sensitivity to minimal phonetic changes in their own name. Infants' reactions to mispronunciations revealed sensitivity to vowel but not consonant changes. Vowels were also more salient (on duration and intensity) but less distinct (on spectrally based measures) than consonants. Lastly, vowel (but not consonant) mispronunciation detection was modulated by acoustic factors, in particular spectrally based distance. These results establish that consonant changes do not affect lexical recognition at 5 months, while vowel changes do; the consonant bias observed later in development does not emerge until after 5 months through additional language exposure. © 2014 John Wiley & Sons Ltd.
The Acquisition of Compound vs. Phrasal Stress: The Role of Prosodic Constituents
ERIC Educational Resources Information Center
Vogel, Irene; Raimy, Eric
2002-01-01
This paper investigates the acquisition of compound vs. phrasal stress ("hot dog" vs. "hot dog") in English. This has previously been shown to be acquired quite late, in contrast to recent research showing that infants both perceive and prefer rhythmic patterns in their own language. Subjects (40 children in four groups the averages ages of which…
The Developing Role of Prosody in Novel Word Interpretation
ERIC Educational Resources Information Center
Herold, Debora S.; Nygaard, Lynne C.; Chicos, Kelly A.; Namy, Laura L.
2011-01-01
This study examined whether children use prosodic correlates to word meaning when interpreting novel words. For example, do children infer that a word spoken in a deep, slow, loud voice refers to something larger than a word spoken in a high, fast, quiet voice? Participants were 4- and 5-year-olds who viewed picture pairs that varied along a…
The Role of Prosodic Boundaries in the Resolution of Lexical Embedding in Speech Comprehension
ERIC Educational Resources Information Center
Salverda, Anne Pier; Dahan, Delphine; McQueen, James M.
2003-01-01
Participants' eye movements were monitored as they heard sentences and saw four pictured objects on a computer screen. Participants were instructed to click on the object mentioned in the sentence. There were more transitory fixations to pictures representing monosyllabic words (e.g. "ham") when the first syllable of the target word (e.g.…
The Prosodic Licensing of Coda Consonants in Early Speech: Interactions with Vowel Length
ERIC Educational Resources Information Center
Miles, Kelly; Yuen, Ivan; Cox, Felicity; Demuth, Katherine
2016-01-01
English has a word-minimality requirement that all open-class lexical items must contain at least two moras of structure, forming a bimoraic foot (Hayes, 1995).Thus, a word with either a long vowel, or a short vowel and a coda consonant, satisfies this requirement. This raises the question of when and how young children might learn this…
Phoneme Restoration Methods Reveal Prosodic Influences on Syntactic Parsing: Data from Bulgarian
ERIC Educational Resources Information Center
Stoyneshka-Raleva, Iglika
2013-01-01
This dissertation introduces and evaluates a new methodology for studying aspects of human language processing and the factors to which it is sensitive. It makes use of the phoneme restoration illusion (Warren, 1970). A small portion of a spoken sentence is replaced by a burst of noise. Listeners typically mentally restore the missing phoneme(s),…
ERIC Educational Resources Information Center
Paul, Rhea; Shriberg, Lawrence D.; McSweeny, Jane; Cicchetti, Domenic; Klin, Ami; Volkmar, Fred
2005-01-01
Shriberg "et al." [Shriberg, L. "et al." (2001). "Journal of Speech, Language and Hearing Research, 44," 1097-1115] described prosody-voice features of 30 high functioning speakers with autistic spectrum disorder (ASD) compared to age-matched control speakers. The present study reports additional information on the speakers with ASD, including…
ERIC Educational Resources Information Center
Belanger, Nathalie; Baum, Shari R.; Titone, Debra
2009-01-01
The neural bases of prosody during the production of literal and idiomatic interpretations of literally plausible idioms was investigated. Left- and right-hemisphere-damaged participants and normal controls produced literal and idiomatic versions of idioms ("He hit the books.") All groups modulated duration to distinguish the interpretations. LHD…
ERIC Educational Resources Information Center
Boucher, Victor J.
2006-01-01
Language learning requires a capacity to recall novel series of speech sounds. Research shows that prosodic marks create grouping effects enhancing serial recall. However, any restriction on memory affecting the reproduction of prosody would limit the set of patterns that could be learned and subsequently used in speech. By implication, grouping…
Cross-Linguistic Perception and Learning of Japanese Lexical Prosody by English Listeners
ERIC Educational Resources Information Center
Shport, Irina A.
2011-01-01
The focus of this dissertation is on how language experience shapes perception of a non-native prosodic contrast. In Tokyo Japanese, fundamental frequency (F0) peak and fall are acoustic cues to lexically contrastive pitch patterns, in which a word may be accented on a particular syllable or unaccented (e.g., "tsuru" "a crane", "tsuru" "a vine",…
ERIC Educational Resources Information Center
Taleghani-Nikazm, Carmen
2016-01-01
This paper offers an instructional unit on the response token "achja" in everyday German conversation. The paper first provides a description of "achja" and its distinctive prosodic features based on empirical research in conversation analysis. The goal of the paper is to provide instructors of German with information and…
ERIC Educational Resources Information Center
Martinez-Castilla, Pastora; Peppe, Susan
2008-01-01
This study aimed to find out what intonation features reliably represent the emotions of "liking" as opposed to "disliking" in the Spanish language, with a view to designing a prosody assessment procedure for use with children with speech and language disorders. 18 intonationally different prosodic realisations (tokens) of one word (limon) were…
Prosodic and Lexical Aspects of Maternal Linguistic Input to Late-Talking Toddlers
ERIC Educational Resources Information Center
D'Odorico, Laura; Jacob, Valentina
2006-01-01
Background: Children who have reached the age of 2 years without having acquired a 50-word vocabulary and/or who use no word combinations are referred to in the literature as "Late Talkers". Research has not yet identified the factors that cause slow development of expressive language; in particular, relatively little research has been carried out…
ERIC Educational Resources Information Center
Krahmer, Emiel; Swerts, Marc
2007-01-01
Speakers employ acoustic cues (pitch accents) to indicate that a word is important, but may also use visual cues (beat gestures, head nods, eyebrow movements) for this purpose. Even though these acoustic and visual cues are related, the exact nature of this relationship is far from well understood. We investigate whether producing a visual beat…
ERIC Educational Resources Information Center
Snow, David
1998-01-01
This paper tested a theory of syllable prominence with 11 children (ages 11 to 26 months). The theory proposes that syllable prominence is a product of two orthogonal suprasegmental systems: stress/accent peaks and phrase boundaries. Use of the developed prominence scale found it parsimoniously accounted for observed biases in syllable omissions…
Oirat Tones and Break Indices (O-ToBI): Intonational Structure of the Oirat Language
ERIC Educational Resources Information Center
Indjieva, Elena
2009-01-01
This doctoral dissertation describes intonation patterns in Spoken Oirat (SO) and proposes a model of the intonational structure of Oirat. The proposed prosodic model is represented in the framework of Oirat Tones and Break Indices (O-ToBI), which is based on the design principles of the original English ToBI (Beckman & Ayers 1994; Beckman…
ERIC Educational Resources Information Center
Ulu, Mustafa
2017-01-01
In this study, the effect of fluent reading (speed, reading accuracy percentage, prosodic reading), comprehension (literal comprehension, inferential comprehension) and problem solving strategies on classifying students with high and low problem solving success was researched. The sampling of the research is composed of 279 students at elementary…
Prosodic Cues in Relative Clauses Disambiguation: Bilinguals vs. L2 Learners
ERIC Educational Resources Information Center
Checa-Garcia, Irene
2016-01-01
This study investigates the preferences for attachment of a relative clause (RC) to a complex noun phrase (NP) of the type: NP1 of NP2, in Spanish-English bilinguals and advanced learners of Spanish. Spanish speakers show a moderate preference for attaching the RC to the first NP, while speakers of English prefer the second NP. Subjects were…
Suprasegmental information affects processing of talking faces at birth.
Guellai, Bahia; Mersad, Karima; Streri, Arlette
2015-02-01
From birth, newborns show a preference for faces talking a native language compared to silent faces. The present study addresses two questions that remained unanswered by previous research: (a) Does the familiarity with the language play a role in this process and (b) Are all the linguistic and paralinguistic cues necessary in this case? Experiment 1 extended newborns' preference for native speakers to non-native ones. Given that fetuses and newborns are sensitive to the prosodic characteristics of speech, Experiments 2 and 3 presented faces talking native and nonnative languages with the speech stream being low-pass filtered. Results showed that newborns preferred looking at a person who talked to them even when only the prosodic cues were provided for both languages. Nonetheless, a familiarity preference for the previously talking face is observed in the "normal speech" condition (i.e., Experiment 1) and a novelty preference in the "filtered speech" condition (Experiments 2 and 3). This asymmetry reveals that newborns process these two types of stimuli differently and that they may already be sensitive to a mismatch between the articulatory movements of the face and the corresponding speech sounds. Copyright © 2014 Elsevier Inc. All rights reserved.
Can Intonational Phrase Structure be Primed (like Syntactic Structure)?
Tooley, Kristen M.; Konopka, Agnieszka E.; Watson, Duane G.
2013-01-01
In three experiments, we investigated whether intonational phrase structure can be primed. In all experiments, participants listened to sentences in which the presence and location of intonational phrase boundaries was manipulated such that the recording either included no intonational phrase boundaries, a boundary in a structurally dispreferred location, in a preferred location, or in both locations. In Experiment 1, participants repeated the sentences to test whether they would reproduce the prosodic structure they had just heard. Experiments 2 and 3 used a prime-target paradigm to evaluate whether the intonational phrase structure heard in the prime sentence might influence that of a novel target sentence. Experiment 1 showed that participants did repeat back sentences that they just heard with the original intonational phrase structure, yet Experiments 2 and 3 found that exposure to intonational phrase boundaries on prime trials did not influence how a novel target sentence was prosodically phrased. These results suggest that speakers may retain the intonational phrasing of a sentence, but this effect is not long-lived and does not generalize across unrelated sentences. Furthermore, these findings provide no evidence that intonational phrase structure is formulated during a planning stage that is separate from other sources of linguistic information. PMID:24188467
Analytic study of the Tadoma method: background and preliminary results.
Norton, S J; Schultz, M C; Reed, C M; Braida, L D; Durlach, N I; Rabinowitz, W M; Chomsky, C
1977-09-01
Certain deaf-blind persons have been taught, through the Tadoma method of speechreading, to use vibrotactile cues from the face and neck to understand speech. This paper reports the results of preliminary tests of the speechreading ability of one adult Tadoma user. The tests were of four major types: (1) discrimination of speech stimuli; (2) recognition of words in isolation and in sentences; (3) interpretation of prosodic and syntactic features in sentences; and (4) comprehension of written (Braille) and oral speech. Words in highly contextual environments were much better perceived than were words in low-context environments. Many of the word errors involved phonemic substitutions which shared articulatory features with the target phonemes, with a higher error rate for vowels than consonants. Relative to performance on word-recognition tests, performance on some of the discrimination tests was worse than expected. Perception of sentences appeared to be mildly sensitive to rate of talking and to speaker differences. Results of the tests on perception of prosodic and syntactic features, while inconclusive, indicate that many of the features tested were not used in interpreting sentences. On an English comprehension test, a higher score was obtained for items administered in Braille than through oral presentation.
Song Perception by Professional Singers and Actors: An MEG Study
Rosslau, Ken; Herholz, Sibylle C.; Knief, Arne; Ortmann, Magdalene; Deuster, Dirk; Schmidt, Claus-Michael; Zehnhoff-Dinnesen, Antoinetteam; Pantev, Christo; Dobel, Christian
2016-01-01
The cortical correlates of speech and music perception are essentially overlapping, and the specific effects of different types of training on these networks remain unknown. We compared two groups of vocally trained professionals for music and speech, singers and actors, using recited and sung rhyme sequences from German art songs with semantic and/ or prosodic/melodic violations (i.e. violations of pitch) of the last word, in order to measure the evoked activation in a magnetoencephalographic (MEG) experiment. MEG data confirmed the existence of intertwined networks for the sung and spoken modality in an early time window after word violation. In essence for this early response, higher activity was measured after melodic/prosodic than semantic violations in predominantly right temporal areas. For singers as well as for actors, modality-specific effects were evident in predominantly left-temporal lateralized activity after semantic expectancy violations in the spoken modality, and right-dominant temporal activity in response to melodic violations in the sung modality. As an indication of a special group-dependent audiation process, higher neuronal activity for singers appeared in a late time window in right temporal and left parietal areas, both after the recited and the sung sequences. PMID:26863437
Acoustic Sources of Accent in Second Language Japanese Speech.
Idemaru, Kaori; Wei, Peipei; Gubbins, Lucy
2018-05-01
This study reports an exploratory analysis of the acoustic characteristics of second language (L2) speech which give rise to the perception of a foreign accent. Japanese speech samples were collected from American English and Mandarin Chinese speakers ( n = 16 in each group) studying Japanese. The L2 participants and native speakers ( n = 10) provided speech samples modeling after six short sentences. Segmental (vowels and stops) and prosodic features (rhythm, tone, and fluency) were examined. Native Japanese listeners ( n = 10) rated the samples with regard to degrees of foreign accent. The analyses predicting accent ratings based on the acoustic measurements indicated that one of the prosodic features in particular, tone (defined as high and low patterns of pitch accent and intonation in this study), plays an important role in robustly predicting accent rating in L2 Japanese across the two first language (L1) backgrounds. These results were consistent with the prediction based on phonological and phonetic comparisons between Japanese and English, as well as Japanese and Mandarin Chinese. The results also revealed L1-specific predictors of perceived accent in Japanese. The findings of this study contribute to the growing literature that examines sources of perceived foreign accent.
Comparing Measures of Voice Quality From Sustained Phonation and Continuous Speech.
Gerratt, Bruce R; Kreiman, Jody; Garellek, Marc
2016-10-01
The question of what type of utterance-a sustained vowel or continuous speech-is best for voice quality analysis has been extensively studied but with equivocal results. This study examines whether previously reported differences derive from the articulatory and prosodic factors occurring in continuous speech versus sustained phonation. Speakers with voice disorders sustained vowels and read sentences. Vowel samples were excerpted from the steadiest portion of each vowel in the sentences. In addition to sustained and excerpted vowels, a 3rd set of stimuli was created by shortening sustained vowel productions to match the duration of vowels excerpted from continuous speech. Acoustic measures were made on the stimuli, and listeners judged the severity of vocal quality deviation. Sustained vowels and those extracted from continuous speech contain essentially the same acoustic and perceptual information about vocal quality deviation. Perceived and/or measured differences between continuous speech and sustained vowels derive largely from voice source variability across segmental and prosodic contexts and not from variations in vocal fold vibration in the quasisteady portion of the vowels. Approaches to voice quality assessment by using continuous speech samples average across utterances and may not adequately quantify the variability they are intended to assess.
Processing Load Imposed by Line Breaks in English Temporal Wh-Questions
Hirotani, Masako; Terry, J. Michael; Sadato, Norihiro
2016-01-01
Prosody plays an important role in online sentence processing both explicitly and implicitly. It has been shown that prosodically packaging together parts of a sentence that are interpreted together facilitates processing of the sentence. This applies not only to explicit prosody but also implicit prosody. The present work hypothesizes that a line break in a written text induces an implicit prosodic break, which, in turn, should result in a processing bias for interpreting English wh-questions. Two experiments—one self-paced reading study and one questionnaire study—are reported. Both supported the “line break” hypothesis mentioned above. The results of the self-paced reading experiment showed that unambiguous wh-questions were read faster when the location of line breaks (or frame breaks) matched the scope of a wh-phrase (main or embedded clause) than when they did not. The questionnaire tested sentences with an ambiguous wh-phrase, one that could attach either to the main or the embedded clause. These sentences were interpreted as attaching to the main clause more often than to the embedded clause when a line break appeared after the main verb, but not when it appeared after the embedded verb. PMID:27774072
Processing Load Imposed by Line Breaks in English Temporal Wh-Questions.
Hirotani, Masako; Terry, J Michael; Sadato, Norihiro
2016-01-01
Prosody plays an important role in online sentence processing both explicitly and implicitly. It has been shown that prosodically packaging together parts of a sentence that are interpreted together facilitates processing of the sentence. This applies not only to explicit prosody but also implicit prosody. The present work hypothesizes that a line break in a written text induces an implicit prosodic break, which, in turn, should result in a processing bias for interpreting English wh-questions. Two experiments-one self-paced reading study and one questionnaire study-are reported. Both supported the "line break" hypothesis mentioned above. The results of the self-paced reading experiment showed that unambiguous wh-questions were read faster when the location of line breaks (or frame breaks) matched the scope of a wh-phrase (main or embedded clause) than when they did not. The questionnaire tested sentences with an ambiguous wh-phrase, one that could attach either to the main or the embedded clause. These sentences were interpreted as attaching to the main clause more often than to the embedded clause when a line break appeared after the main verb, but not when it appeared after the embedded verb.
Bruce, Carolyn; To, Cinn-Teng; Newton, Caroline
2012-01-01
This study explored whether an unfamiliar non-native accent, differing in both segmental and prosodic features was more difficult for individuals with aphasia to understand than an unfamiliar native accent, which differed in segmental features only. Comprehension, which was determined by accuracy judgments on true/false sentences, and speed of response were assessed in the following three conditions: a familiar Southern Standard British English (SSBE) accent, an unfamiliar native Grimsby accent, and an unfamiliar non-native Chinese accent. Thirty-four English speaking adults (17 people with and 17 people without aphasia) served as listeners for this study. All listeners made significantly more errors in the unfamiliar non-native accent, although this difficulty was more marked for those with aphasia. While there was no affect of speaker accent on the response times of listeners with aphasia, listeners without aphasia were significantly slower with the unfamiliar non-native accent. The results indicate that non-native accented speech affects comprehension even on simple tasks in ideal listening conditions. The findings suggest that speaker accent, especially accents varying in both segmental and prosodic features, can be a barrier to successful interactions between non-native accented speakers and native listeners, particularly those with aphasia.
Acoustic and perceptual cues for compound-phrasal contrasts in Vietnamese.
Nguyen, Anh-Thu T; Ingram, John C L
2007-09-01
This paper reports two series of experiments that examined the phonetic correlates of lexical stress in Vietnamese compounds in comparison to their phrasal constructions. In the first series of experiments, acoustic and perceptual characteristics of Vietnamese compound words and their phrasal counterparts were investigated on five likely acoustic correlates of stress or prominence (f0 range and contour, duration, intensity and spectral slope, vowel reduction), elicited under two distinct speaking conditions: a "normal speaking" condition and a "maximum contrast" condition which encouraged speakers to employ prosodic strategies for disambiguation. The results suggested that Vietnamese lacks phonetic resources for distinguishing compounds from phrases lexically and that native speakers may employ a phrase-level prosodic disambiguation strategy (juncture marking), when required to do so. However, in a second series of experiments, minimal pairs of bisyllabic coordinative compounds with reversible syllable positions were examined for acoustic evidence of asymmetrical prominence relations. Clear evidence of asymmetric prominences in coordinative compounds was found, supporting independent results obtained from an analysis of reduplicative compounds and tone sandhi in Vietnamese [Nguye;n and Ingram, 2006]. A reconciliation of these apparently conflicting findings on word stress in Vietnamese is presented and discussed.
Expression of emotions and physiological changes during teaching
NASA Astrophysics Data System (ADS)
Tobin, Kenneth; King, Donna; Henderson, Senka; Bellocchi, Alberto; Ritchie, Stephen M.
2016-09-01
We investigated the expression of emotions while teaching in relation to a teacher's physiological changes. We used polyvagal theory (PVT) to frame the study of teaching in a teacher education program. Donna, a teacher-researcher, experienced high levels of stress and anxiety prior to beginning to teach and throughout the lesson we used her expressed emotions as a focus for this research. We adopted event-oriented inquiry in a study of heart rate, oxygenation of the blood, and expressed emotions. Five events were identified for multilevel analysis in which we used narrative, prosodic analysis, and hermeneutic-phenomenological methods to learn more about the expression of emotions when Donna had: high heart rate (before and while teaching); low blood oxygenation (before and while teaching); and high blood oxygenation (while teaching). What we learned was consistent with the body's monitoring system recognizing social harm and switching to the control of the unmyelinated vagus nerve, thereby shutting down organs and muscles associated with social communication—leading to irregularities in prosody and expression of emotion. In events involving high heart rate and low blood oxygenation the physiological environment was associated with less effective and sometimes confusing patterns in prosody, including intonation, pace of speaking, and pausing. In a low blood oxygenation environment there was evidence of rapid speech and shallow, irregular breathing. In contrast, during an event in which 100 % blood oxygenation occurred, prosody was perceived to be conducive to engagement and teacher expressed positive emotions, such as satisfaction, while teaching. Becoming aware of the purposes of the research and the results we obtained provided the teacher with tools to enact changes to her teaching practice, especially prosody of the voice. We regard it as a high priority to create tools to allow teachers and students, if and as necessary, to ameliorate excess emotions, and change heart rate, oxygenation levels, and breathing patterns.
ERIC Educational Resources Information Center
Diehl, Joshua John; Paul, Rhea
2013-01-01
Prosody production atypicalities are a feature of autism spectrum disorders (ASDs), but behavioral measures of performance have failed to provide detail on the properties of these deficits. We used acoustic measures of prosody to compare children with ASDs to age-matched groups with learning disabilities and typically developing peers. Overall,…
ERIC Educational Resources Information Center
Frankel, Lois; Brownstein, Beth
2016-01-01
The work described in this report is the second phase of a project to provide easy-to-use tools for authoring and rendering secondaryschool algebra-levelmath expressions insynthesized speech that is useful for studentswithblindnessor lowvision.This report describes the development and results of the second feedback study performed for our project,…
Prosodic Stress, Information, and Intelligibility of Speech in Noise
2009-02-28
across periods during which acoustic information has been suppressed. 15. SUBJECT TERMS Robust speech intelligibility Computational model of...Research Fellow at the Department of Computer Science at the University of Southern California). This research involved superimposing acoustic and...presented at an invitational-only session of the Acoustical Society of America’s and European Acoustic Association’s joint meeting in 2008. In summary, the
ERIC Educational Resources Information Center
Fengler, Ineke; Delfau, Pia-Céline; Röder, Brigitte
2018-01-01
It is yet unclear whether congenitally deaf cochlear implant (CD CI) users' visual and multisensory emotion perception is influenced by their history in sign language acquisition. We hypothesized that early-signing CD CI users, relative to late-signing CD CI users and hearing, non-signing controls, show better facial expression recognition and…
German Pitches in English: Production and Perception of Cross-Varietal Differences in L2
ERIC Educational Resources Information Center
Ulbrich, Christiane
2013-01-01
The present study examines the effect of cross-varietal prosodic characteristics of two German varieties, Northern Standard German (NG) and Swiss German (SG), on the production and perception of foreign accent in L2 Belfast English. The analysis of production data revealed differences in the realisation of nuclear pitch accents in L1 German and L2…
ERIC Educational Resources Information Center
Stafford, Lorenzo D.; Brandaro, Nicola
2010-01-01
Recent research has looked at whether the expectancy of an emotion can account for subsequent valence specific laterality effects of prosodic emotion, though no research has examined this effect for facial emotion. In the study here (n = 58), we investigated this issue using two tasks; an emotional face perception task and a novel word task that…
ERIC Educational Resources Information Center
Dekydtspotter, Laurent; Donaldson, Bryan; Edmonds, Amanda C.; Fultz, Audrey Liljestrand; Petrush, Rebecca A.
2008-01-01
This study investigates the manner in which syntax, prosody, and context interact when second- and fourth-semester college-level English-French learners process relative clause (RC) attachment to either the first noun phrase (NP1) or the second noun phrase (NP2) in complex nominal expressions such as "le secretaire du psychologue qui se promene"…
Identifying Deceptive Speech Across Cultures
2016-06-25
34 Interspeech 2016. 2016. G. An, S. I. Levitan, R. Levitan, A. Rosenberg, M. Levine, J. Hirschberg, "Automatically Classifying Self -Rated Personality Scores from...law, no person shall be subject to any penalty for failing to comply with a collection of information if it does not display a currently valid OMB...correlations of deception ability with personality factors (extraversion, conscientiousness). Using acoustic-prosodic features, gender, ethnicity and
A Toddler's Treatment of "Mm" and "Mm Hm" in Talk with a Parent
ERIC Educational Resources Information Center
Filipi, Anna
2007-01-01
The study to be reported in this paper examined the work accomplished by "mm" and "mm hm" in the interactions of a parent and his daughter aged 0;10-2;0. Using the findings of Gardner (2001) for adults, the analysis shows that "mm" accomplished a range of functions based on its sequential placement and prosodic features, whereas "mm hm" was much…
ERIC Educational Resources Information Center
Ploog, Bertram O.; Banerjee, Snigdha; Brooks, Patricia J.
2009-01-01
This study validated a video game paradigm to explore attention to prosodic and linguistic components of spoken sentences in nine moderate-to-low functioning children with autism and impaired verbal skills. Nine typically developing children were also included. The children listened to pre-recorded sentences varying with respect to content (e.g.,…
Leitman, David I; Ziwich, Rachel; Pasternak, Roey; Javitt, Daniel C
2006-08-01
Theory of Mind (ToM) refers to the ability to infer another person's mental state based upon interactional information. ToM deficits have been suggested to underlie crucial aspects of social interaction failure in disorders such as autism and schizophrenia, although the development of paradigms for demonstrating such deficits remains an ongoing area of research. Recent studies have explored the use of sarcasm perception, in which subjects must infer an individual's sincerity or lack thereof, as a 'real-life' index of ToM ability, and as an index of functioning of specific right hemispheric structures. Sarcastic detection ability has not previously been studied in schizophrenia, although patients have been shown to have deficits in ability to decode emotional information from speech ('affective prosody'). Twenty-two schizophrenia patients and 17 control subjects were tested on their ability to detect sarcasm from spoken speech as well as measures of affective prosody and basic pitch perception. Despite normal overall intelligence, patients performed substantially worse than controls in ability to detect sarcasm (d=2.2), showing both decreased sensitivity (A') in detection of sincerity versus sarcasm and an increased bias (B'') toward sincerity. Correlations across groups revealed significant relationships between impairments in sarcasm recognition, affective prosody and basic pitch perception. These findings demonstrate substantial deficits in ability to infer an internal subjective state based upon vocal modulation among subjects with schizophrenia. Deficits were related to, but were significantly more severe than, more general forms of prosodic and sensorial misperception, and are consistent with both right hemispheric and 'bottom-up' theories of the disorder.
Neural basis of processing threatening voices in a crowded auditory world
Mothes-Lasch, Martin; Becker, Michael P. I.; Miltner, Wolfgang H. R.
2016-01-01
In real world situations, we typically listen to voice prosody against a background crowded with auditory stimuli. Voices and background can both contain behaviorally relevant features and both can be selectively in the focus of attention. Adequate responses to threat-related voices under such conditions require that the brain unmixes reciprocally masked features depending on variable cognitive resources. It is unknown which brain systems instantiate the extraction of behaviorally relevant prosodic features under varying combinations of prosody valence, auditory background complexity and attentional focus. Here, we used event-related functional magnetic resonance imaging to investigate the effects of high background sound complexity and attentional focus on brain activation to angry and neutral prosody in humans. Results show that prosody effects in mid superior temporal cortex were gated by background complexity but not attention, while prosody effects in the amygdala and anterior superior temporal cortex were gated by attention but not background complexity, suggesting distinct emotional prosody processing limitations in different regions. Crucially, if attention was focused on the highly complex background, the differential processing of emotional prosody was prevented in all brain regions, suggesting that in a distracting, complex auditory world even threatening voices may go unnoticed. PMID:26884543
The impact of rate reduction and increased vocal intensity on coarticulation in dysarthria
NASA Astrophysics Data System (ADS)
Tjaden, Kris
2003-04-01
The dysarthrias are a group of speech disorders resulting from impairment to nervous system structures important for the motor execution of speech. Although numerous studies have examined how dysarthria impacts articulatory movements or changes in vocal tract shape, few studies of dysarthria consider that articulatory events and their acoustic consequences overlap or are coarticulated in connected speech. The impact of rate, loudness, and clarity on coarticulatory patterns in dysarthria also are poorly understood, although these prosodic manipulations frequently are employed as therapy strategies to improve intelligibility in dysarthria and also are known to affect coarticulatory patterns for at least some neurologically healthy speakers. The current study examined the effects of slowed rate and increased vocal intensity on anticipatory coarticulation for speakers with dysarthria secondary to Multiple Sclerosis (MS), as inferred from the acoustic signal. Healthy speakers were studied for comparison purposes. Three repetitions of twelve target words embedded in the carrier phrase ``It's a -- again'' were produced in habitual, loud, and slow speaking conditions. F2 frequencies and first moment coefficients were used to infer coarticulation. Both group and individual speaker trends will be examined in the data analyses.
Vocal learning, prosody, and basal ganglia: don't underestimate their complexity.
Ravignani, Andrea; Martins, Mauricio; Fitch, W Tecumseh
2014-12-01
Ackermann et al.'s arguments in the target article need sharpening and rethinking at both mechanistic and evolutionary levels. First, the authors' evolutionary arguments are inconsistent with recent evidence concerning nonhuman animal rhythmic abilities. Second, prosodic intonation conveys much more complex linguistic information than mere emotional expression. Finally, human adults' basal ganglia have a considerably wider role in speech modulation than Ackermann et al. surmise.
The ICSI+ Multilingual Sentence Segmentation System
2006-01-01
these steps the ASR output needs to be enriched with information additional to words, such as speaker diarization , sentence segmentation, or story...and the out- of a speaker diarization is considered as well. We first detail extraction of the prosodic features, and then describe the clas- ation...also takes into account the speaker turns that estimated by the diarization system. In addition to the Max- 1) model speaker turn unigrams, trigram
ERIC Educational Resources Information Center
Frankel, Lois; Brownstein, Beth
2016-01-01
The work described in this report is the second phase of a project to provide easy-to-use tools for authoring and rendering secondary-school algebra-level math expressions in synthesized speech that is useful for students with blindness or low vision. This report describes the development and results of the second feedback study performed for our…
ERIC Educational Resources Information Center
Li, Aike; Post, Brechtje
2014-01-01
This study examines the development of speech rhythm in second language (L2) learners of typologically different first languages (L1s) at different levels of proficiency. An empirical investigation of durational variation in L2 English productions by L1 Mandarin learners and L1 German learners compared to native control values in English and the…
Phonation takes precedence over articulation in development as well as evolution of language.
Oller, D Kimbrough
2014-12-01
Early human vocal development is characterized first by emerging control of phonation and later by prosodic and supraglottal articulation. The target article has missed the opportunity to use these facts in the characterization of evolution in language-specific brain mechanisms. Phonation appears to be the initial human-specific brain change for language, and it was presumably a key target of selection in early hominin evolution.
ERIC Educational Resources Information Center
Hirose, Yuki; Mazuka, Reiko
2017-01-01
A noun can be potentially ambiguous as to whether it is a head on its own, or is a modifier of a Noun + Noun compound waiting for its head. This study investigates whether young children can exploit the prosodic information on a modifier constituent preceding the head to facilitate resolution of such ambiguity in Japanese. Evidence from English…
ERIC Educational Resources Information Center
Liang, Jie; van Heuven, Vincent J.
2004-01-01
We present an acoustic study of segmental and prosodic properties of words produced by a female speaker of Chinese with left-hemisphere brain damage. We measured the location of the point vowels /a, e, @?, i, y, o, u/ and determined their separation in the vowel plane, and their perceptual distinctivity. Similarly, the acoustic properties of the…
ERIC Educational Resources Information Center
Blount, Ben G.; Padgug, Elise J.
Features of parental speech to young children was studied in four English-speaking and four Spanish-speaking families. Children ranged in age from 9 to 12 months for the English speakers and from 8 to 22 months for the Spanish speakers. Examination of the utterances led to the identification of 34 prosodic, paralinguistic, and interactional…
Mitchell, Rachel L. C.; Jazdzyk, Agnieszka; Stets, Manuela; Kotz, Sonja A.
2016-01-01
We aimed to progress understanding of prosodic emotion expression by establishing brain regions active when expressing specific emotions, those activated irrespective of the target emotion, and those whose activation intensity varied depending on individual performance. BOLD contrast data were acquired whilst participants spoke non-sense words in happy, angry or neutral tones, or performed jaw-movements. Emotion-specific analyses demonstrated that when expressing angry prosody, activated brain regions included the inferior frontal and superior temporal gyri, the insula, and the basal ganglia. When expressing happy prosody, the activated brain regions also included the superior temporal gyrus, insula, and basal ganglia, with additional activation in the anterior cingulate. Conjunction analysis confirmed that the superior temporal gyrus and basal ganglia were activated regardless of the specific emotion concerned. Nevertheless, disjunctive comparisons between the expression of angry and happy prosody established that anterior cingulate activity was significantly higher for angry prosody than for happy prosody production. Degree of inferior frontal gyrus activity correlated with the ability to express the target emotion through prosody. We conclude that expressing prosodic emotions (vs. neutral intonation) requires generic brain regions involved in comprehending numerous aspects of language, emotion-related processes such as experiencing emotions, and in the time-critical integration of speech information. PMID:27803656
Combining formal and functional approaches to topic structure.
Zellers, Margaret; Post, Brechtje
2012-03-01
Fragmentation between formal and functional approaches to prosodic variation is an ongoing problem in linguistic research. In particular, the frameworks of the Phonetics of Talk-in-Interaction (PTI) and Empirical Phonology (EP) take very different theoretical and methodological approaches to this kind of variation. We argue that it is fruitful to adopt the insights of both PTI's qualitative analysis and EP's quantitative analysis and combine them into a multiple-methods approach. One realm in which it is possible to combine these frameworks is in the analysis of discourse topic structure and the prosodic cues relevant to it. By combining a quantitative and a qualitative approach to discourse topic structure, it is possible to give a better account of the observed variation in prosody, for example in the case of fundamental frequency (F0) peak timing, which can be explained in terms of pitch accent distribution over different topic structure categories. Similarly, local and global patterns in speech rate variation can be better explained and motivated by adopting insights from both PTI and EP in the study of topic structure. Combining PTI and EP can provide better accounts of speech data as well as opening up new avenues of investigation which would not have been possible in either approach alone.
Boucher, Victor J
2006-01-01
Language learning requires a capacity to recall novel series of speech sounds. Research shows that prosodic marks create grouping effects enhancing serial recall. However, any restriction on memory affecting the reproduction of prosody would limit the set of patterns that could be learned and subsequently used in speech. By implication, grouping effects of prosody would also be limited to reproducible patterns. This view of the role of prosody and the contribution of memory processes in the organization of prosodic patterns is examined by evaluating the correspondence between a reported tendency to restrict stress intervals in speech and size limits on stress-grouping effects. French speech is used where stress defines the endpoints of groups. In Experiment 1, 40 speakers recalled novel series of syllables containing stress-groups of varying size. Recall was not enhanced by groupings exceeding four syllables, which corresponded to a restriction on the reproducibility of stress-groups. In Experiment 2, the subjects produced given sentences containing phrases of differing length. The results show a strong tendency to insert stress within phrases that exceed four syllables. Since prosody can arise in the recall of syntactically unstructured lists, the results offer initial support for viewing memory processes as a factor of stress-rhythm organization.
Kujala, T; Kuuluvainen, S; Saalasti, S; Jansson-Verkasalo, E; von Wendt, L; Lepistö, T
2010-09-01
Asperger syndrome, belonging to the autistic spectrum of disorders, involves deficits in social interaction and prosodic use of language but normal development of formal language abilities. Auditory processing involves both hyper- and hypoactive reactivity to acoustic changes. Responses composed of mismatch negativity (MMN) and obligatory components were recorded for five types of deviations in syllables (vowel, vowel duration, consonant, syllable frequency, syllable intensity) with the multi-feature paradigm from 8-12-year old children with Asperger syndrome. Children with Asperger syndrome had larger MMNs for intensity and smaller MMNs for frequency changes than typically developing children, whereas no MMN group differences were found for the other deviant stimuli. Furthermore, children with Asperger syndrome performed more poorly than controls in Comprehension of Instructions subtest of a language test battery. Cortical speech-sound discrimination is aberrant in children with Asperger syndrome. This is evident both as hypersensitive and depressed neural reactions to speech-sound changes, and is associated with features (frequency, intensity) which are relevant for prosodic processing. The multi-feature MMN paradigm, which includes variation and thereby resembles natural speech hearing circumstances, suggests abnormal pattern of speech discrimination in Asperger syndrome, including both hypo- and hypersensitive responses for speech features. 2010 International Federation of Clinical Neurophysiology. Published by Elsevier Ireland Ltd. All rights reserved.
Kaland, Constantijn; Swerts, Marc; Krahmer, Emiel
2013-09-01
The present research investigates what drives the prosodic marking of contrastive information. For example, a typically developing speaker of a Germanic language like Dutch generally refers to a pink car as a "PINK car" (accented words in capitals) when a previously mentioned car was red. The main question addressed in this paper is whether contrastive intonation is produced with respect to the speaker's or (also) the listener's perspective on the preceding discourse. Furthermore, this research investigates the production of contrastive intonation by typically developing speakers and speakers with autism. The latter group is investigated because people with autism are argued to have difficulties accounting for another person's mental state and exhibit difficulties in the production and perception of accentuation and pitch range. To this end, utterances with contrastive intonation are elicited from both groups and analyzed in terms of function and form of prosody using production and perception measures. Contrary to expectations, typically developing speakers and speakers with autism produce functionally similar contrastive intonation as both groups account for both their own and their listener's perspective. However, typically developing speakers use a larger pitch range and are perceived as speaking more dynamically than speakers with autism, suggesting differences in their use of prosodic form.
Basirat, Anahita
2017-01-01
Cochlear implant (CI) users frequently achieve good speech understanding based on phoneme and word recognition. However, there is a significant variability between CI users in processing prosody. The aim of this study was to examine the abilities of an excellent CI user to segment continuous speech using intonational cues. A post-lingually deafened adult CI user and 22 normal hearing (NH) subjects segmented phonemically identical and prosodically different sequences in French such as 'l'affiche' (the poster) versus 'la fiche' (the sheet), both [lafiʃ]. All participants also completed a minimal pair discrimination task. Stimuli were presented in auditory-only and audiovisual presentation modalities. The performance of the CI user in the minimal pair discrimination task was 97% in the auditory-only and 100% in the audiovisual condition. In the segmentation task, contrary to the NH participants, the performance of the CI user did not differ from the chance level. Visual speech did not improve word segmentation. This result suggests that word segmentation based on intonational cues is challenging when using CIs even when phoneme/word recognition is very well rehabilitated. This finding points to the importance of the assessment of CI users' skills in prosody processing and the need for specific interventions focusing on this aspect of speech communication.
Talk this way: the effect of prosodically conveyed semantic information on memory for novel words.
Shintel, Hadas; Anderson, Nathan L; Fenn, Kimberly M
2014-08-01
Speakers modulate their prosody to express not only emotional information but also semantic information (e.g., raising pitch for upward motion). Moreover, this information can help listeners infer meaning. Work investigating the communicative role of prosodically conveyed meaning has focused on reference resolution, and potential mnemonic benefits remain unexplored. We investigated the effect of prosody on memory for the meaning of novel words, even when it conveys superfluous information. Participants heard novel words, produced with congruent or incongruent prosody, and viewed image pairs representing the intended meaning and its antonym (e.g., a small and a large dog). Importantly, an arrow indicated the image representing the intended meaning, resolving the ambiguity. Participants then completed 2 memory tests, either immediately after learning or after a 24-hr delay, on which they chose an image (out of a new image pair) and a definition that best represented the word. On the image test, memory was similar on the immediate test, but incongruent prosody led to greater loss over time. On the definition test, memory was better for congruent prosody at both times. Results suggest that listeners extract semantic information from prosody even when it is redundant and that prosody can enhance memory, beyond its role in comprehension. PsycINFO Database Record (c) 2014 APA, all rights reserved.
González-Fuente, Santiago; Tubau, Susagna; Espinal, M. Teresa; Prieto, Pilar
2015-01-01
Previous research has proposed that languages diverge with respect to how their speakers confirm and contradict negative questions. Taking into account the classification between truth-based and polarity-based languages, this paper is mainly concerned with the expression of REJECT (a semantic operation that signals a contradiction move with respect to the common ground, along Krifka's lines) in two languages belonging to two typologically distinct answering systems, namely Catalan (polarity-based) and Russian (a mixed system using polarity-based, truth-based, and echoic strategies). This investigation has two goals. First, to assess empirically the relevance of prosodic and gestural patterns in the interpretation of confirming and rejecting responses to negative polar questions. Second, to test the claim that in fact speakers resort to strikingly similar universal strategies at the time of expressing rejecting answers to discourse accessible negative assertions and negative polar questions, namely the use of linguistic units that encode REJECT in combination with ASSERT. The results of our investigation support the existence of a universal answering system for rejecting negative polar questions that integrates lexical and syntactic strategies with prosodic and gestural patterns, and instantiate the REJECT and ASSERT operators. We will also discuss the implications these results have for the truth-based vs. polarity-based taxonomy. PMID:26217255
Acquisition of English word stress patterns in early and late bilinguals
NASA Astrophysics Data System (ADS)
Guion, Susan G.
2004-05-01
Given early acquisition of prosodic knowledge as demonstrated by infants' sensitivity to native language accentual patterns, the question of whether learners can acquire new prosodic patterns across the life span arises. Acquisition of English stress by early and late Spanish-English and Korean-English bilinguals was investigated. In a production task, two-syllable nonwords were produced in noun and verb sentence frames. In a perception task, preference for first or last syllable stress on the nonwords was indicated. Also, real words that were phonologically similar to the nonwords were collected. Logistic regression analyses and ANOVAs were conducted to determine the effect of three factors (syllable structure, lexical class, and stress patterns of phonologically similar words) on the production and perception responses. In all three groups, stress patterns of phonologically similar real words predicted stress on nonwords. For the two other factors, early bilinguals patterned similarly to the native-English participants. Late Spanish-English bilinguals demonstrated less learning of stress patterns based on syllabic structure, and late Korean-English bilinguals demonstrated less learning of stress patterns based on lexical class than native-English speakers. Thus, compared to native speakers, late bilinguals' ability to abstract stress patterns is reduced and affected by the first language. [Work supported by NIH.
Learning word order at birth: A NIRS study.
Benavides-Varela, Silvia; Gervain, Judit
2017-06-01
In language, the relative order of words in sentences carries important grammatical functions. However, the developmental origins and the neural correlates of the ability to track word order are to date poorly understood. The current study therefore investigates the origins of infants' ability to learn about the sequential order of words, using near-infrared spectroscopy (NIRS) with newborn infants. We have conducted two experiments: one in which a word order change was implemented in 4-word sequences recorded with a list intonation (as if each word was a separate item in a list; list prosody condition, Experiment 1) and one in which the same 4-word sequences were recorded with a well-formed utterance-level prosodic contour (utterance prosody condition, Experiment 2). We found that newborns could detect the violation of the word order in the list prosody condition, but not in the utterance prosody condition. These results suggest that while newborns are already sensitive to word order in linguistic sequences, prosody appears to be a stronger cue than word order for the identification of linguistic units at birth. Copyright © 2017. Published by Elsevier Ltd.
Kello, Christopher T; Bella, Simone Dalla; Médé, Butovens; Balasubramaniam, Ramesh
2017-10-01
Humans talk, sing and play music. Some species of birds and whales sing long and complex songs. All these behaviours and sounds exhibit hierarchical structure-syllables and notes are positioned within words and musical phrases, words and motives in sentences and musical phrases, and so on. We developed a new method to measure and compare hierarchical temporal structures in speech, song and music. The method identifies temporal events as peaks in the sound amplitude envelope, and quantifies event clustering across a range of timescales using Allan factor (AF) variance. AF variances were analysed and compared for over 200 different recordings from more than 16 different categories of signals, including recordings of speech in different contexts and languages, musical compositions and performances from different genres. Non-human vocalizations from two bird species and two types of marine mammals were also analysed for comparison. The resulting patterns of AF variance across timescales were distinct to each of four natural categories of complex sound: speech, popular music, classical music and complex animal vocalizations. Comparisons within and across categories indicated that nested clustering in longer timescales was more prominent when prosodic variation was greater, and when sounds came from interactions among individuals, including interactions between speakers, musicians, and even killer whales. Nested clustering also was more prominent for music compared with speech, and reflected beat structure for popular music and self-similarity across timescales for classical music. In summary, hierarchical temporal structures reflect the behavioural and social processes underlying complex vocalizations and musical performances. © 2017 The Author(s).
2009-04-08
to changes on input data is quantified. It is also shown in a perceptive evaluation that the presented objective approach of dialect distance...of Arabic dialects are discussed. We also show the repeatability of presented mea- sure, and its correlation with human perception . Conclusions are...in the strict sense of metric spaces. PREPRINT 1 2. Proposed Method Human perception tests indicate that prosodic cues, including pitch movements
ERIC Educational Resources Information Center
Haskins Labs., New Haven, CT.
This report is one of a regular series about the status and progress of studies on the nature of speech, instrumentation for its investigation, and practical applications. The 17 papers discuss the identification of sine-wave analogues of speech sounds; prosodic information for vowel identity; progressive changes in articulatory patterns in verbal…
Ross, Elliott D; Monnot, Marilee
2011-04-01
The Aprosodia Battery was developed to distinguish different patterns of affective-prosodic deficits in patients with left versus right brain damage by using affective utterances with incrementally reduced verbal-articulatory demands. It has also been used to assess affective-prosodic performance in various clinical groups, including patients with schizophrenia, PTSD, multiple sclerosis, alcohol abuse and Alzheimer disease and in healthy adults, as means to explore maturational-aging effects. To date, all studies using the Aprosodia Battery have yielded statistically robust results. This paper describes an extensive, quantitative error analysis using previous results from the Aprosodia Battery in patients with left and right brain damage, age-equivalent controls (old adults), and a group of young adults. This inductive analysis was performed to address three major issues in the literature: (1) sex and (2) maturational-aging effects in comprehending affective prosody and (3) differential hemispheric lateralization of emotions. We found no overall sex effects for comprehension of affective prosody. There were, however, scattered sex effects related to a particular affect, suggesting that these differences were related to cognitive appraisal rather than primary perception. Results in the brain damaged groups did not support the Valence Hypothesis of emotional lateralization but did support the Right Hemisphere Hypothesis of emotional lateralization. When comparing young versus old adults, a robust maturational-aging effect was observed in overall error rates and in the distribution of errors across affects. This effect appears to be mediated, in part, by cognitive appraisal, causing an alteration in the salience of different affective-prosodic stimuli with increasing age. In addition, the maturational-aging effects lend support for the Emotion-Type hypothesis of emotional lateralization and the "classic aging effect" that is due primarily to decline of right hemisphere cognitive functions in senescence. The results of our inductive analysis may help direct future deductive research efforts, exploring the neuropsychology of emotional communication, by taking into account the potentially confounding influence of (1) methodological differences involving construction of test stimuli and assessment procedures, (2) developmental, maturational and aging effects related to cognitive appraisal and (3) whether a stimulus has a primary or social-emotional bias. Published by Elsevier Ltd.
Entropy Based Classifier Combination for Sentence Segmentation
2007-01-01
speaker diarization system to divide the audio data into hypothetical speakers [17...the prosodic feature also includes turn-based features which describe the position of a word in relation to diarization seg- mentation. The speaker ...ro- bust speaker segmentation: the ICSI-SRI fall 2004 diarization system,” in Proc. RT-04F Workshop, 2004. [18] “The rich transcription fall 2003,” http://nist.gov/speech/tests/rt/rt2003/fall/docs/rt03-fall-eval- plan-v9.pdf.
Loutrari, Ariadne; Lorch, Marjorie Perlman
2017-07-01
We present a follow-up study on the case of a Greek amusic adult, B.Z., whose impaired performance on scale, contour, interval, and meter was reported by Paraskevopoulos, Tsapkini, and Peretz in 2010, employing a culturally-tailored version of the Montreal Battery of Evaluation of Amusia. In the present study, we administered a novel set of perceptual judgement tasks designed to investigate the ability to appreciate holistic prosodic aspects of 'expressiveness' and emotion in phrase length music and speech stimuli. Our results show that, although diagnosed as a congenital amusic, B.Z. scored as well as healthy controls (N=24) on judging 'expressiveness' and emotional prosody in both speech and music stimuli. These findings suggest that the ability to make perceptual judgements about such prosodic qualities may be preserved in individuals who demonstrate difficulties perceiving basic musical features such as melody or rhythm. B.Z.'s case yields new insights into amusia and the processing of speech and music prosody through a holistic approach. The employment of novel stimuli with relatively fewer non-naturalistic manipulations, as developed for this study, may be a useful tool for revealing unexplored aspects of music and speech cognition and offer the possibility to further the investigation of the perception of acoustic streams in more authentic auditory conditions. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.
Acoustic cues to perception of word stress by English, Mandarin, and Russian speakers.
Chrabaszcz, Anna; Winn, Matthew; Lin, Candise Y; Idsardi, William J
2014-08-01
This study investigated how listeners' native language affects their weighting of acoustic cues (such as vowel quality, pitch, duration, and intensity) in the perception of contrastive word stress. Native speakers (N = 45) of typologically diverse languages (English, Russian, and Mandarin) performed a stress identification task on nonce disyllabic words with fully crossed combinations of each of the 4 cues in both syllables. The results revealed that although the vowel quality cue was the strongest cue for all groups of listeners, pitch was the second strongest cue for the English and the Mandarin listeners but was virtually disregarded by the Russian listeners. Duration and intensity cues were used by the Russian listeners to a significantly greater extent compared with the English and Mandarin participants. Compared with when cues were noncontrastive across syllables, cues were stronger when they were in the iambic contour than when they were in the trochaic contour. Although both English and Russian are stress languages and Mandarin is a tonal language, stress perception performance of the Mandarin listeners but not of the Russian listeners is more similar to that of the native English listeners, both in terms of weighting of the acoustic cues and the cues' relative strength in different word positions. The findings suggest that tuning of second-language prosodic perceptions is not entirely predictable by prosodic similarities across languages.
Acoustic Cues to Perception of Word Stress by English, Mandarin, and Russian Speakers
Chrabaszcz, Anna; Winn, Matthew; Lin, Candise Y.; Idsardi, William J.
2017-01-01
Purpose This study investigated how listeners’ native language affects their weighting of acoustic cues (such as vowel quality, pitch, duration, and intensity) in the perception of contrastive word stress. Method Native speakers (N = 45) of typologically diverse languages (English, Russian, and Mandarin) performed a stress identification task on nonce disyllabic words with fully crossed combinations of each of the 4 cues in both syllables. Results The results revealed that although the vowel quality cue was the strongest cue for all groups of listeners, pitch was the second strongest cue for the English and the Mandarin listeners but was virtually disregarded by the Russian listeners. Duration and intensity cues were used by the Russian listeners to a significantly greater extent compared with the English and Mandarin participants. Compared with when cues were noncontrastive across syllables, cues were stronger when they were in the iambic contour than when they were in the trochaic contour. Conclusions Although both English and Russian are stress languages and Mandarin is a tonal language, stress perception performance of the Mandarin listeners but not of the Russian listeners is more similar to that of the native English listeners, both in terms of weighting of the acoustic cues and the cues’ relative strength in different word positions. The findings suggest that tuning of second-language prosodic perceptions is not entirely predictable by prosodic similarities across languages. PMID:24686836
Stress priming in reading and the selective modulation of lexical and sub-lexical pathways.
Colombo, Lucia; Zevin, Jason
2009-09-29
Four experiments employed a priming methodology to investigate different mechanisms of stress assignment and how they are modulated by lexical and sub-lexical mechanisms in reading aloud in Italian. Lexical stress is unpredictable in Italian, and requires lexical look-up. The most frequent stress pattern (Dominant) is on the penultimate syllable [laVOro (work)], while stress on the antepenultimate syllable [MAcchina (car)] is relatively less frequent (non-Dominant). Word and pseudoword naming responses primed by words with non-dominant stress--which require whole-word knowledge to be read correctly--were compared to those primed by nonwords. Percentage of errors to words and percentage of dominant stress responses to nonwords were measured. In Experiments 1 and 2 stress errors increased for non-dominant stress words primed by nonwords, as compared to when they were primed by words. The results could be attributed to greater activation of sub-lexical codes, and an associated tendency to assign the dominant stress pattern by default in the nonword prime condition. Alternatively, they may have been the consequence of prosodic priming, inducing more errors on trials in which the stress pattern of primes and targets was not congruent. The two interpretations were investigated in Experiments 3 and 4. The results overall suggested a limited role of the default metrical pattern in word pronunciation, and showed clear effect of prosodic priming, but only when the sub-lexical mechanism prevailed.
Chakraborty, Rahul; Goffman, Lisa
2013-01-01
Purpose To assess the influence of L2 proficiency on production characteristics of rhythmic sequences in L1 (Bengali) and L2 (English), with emphasis on linguistic transfer. One goal was to examine, using kinematic evidence, how L2- proficiency influences the production of iambic and trochaic words, focusing on temporal and spatial aspects of prosody. A second goal was to assess whether prosodic structure influences judgment of foreign accent. Method Twenty Bengali-English bilinguals, 10 with low proficiency and 10 with high proficiency in English, and 10 monolingual English speakers participated. Lip and jaw movements were recorded while the bilinguals produced Bengali and English words embedded in sentences. Lower lip movement amplitude and duration were measured in trochaic and iambic words. Six native English listeners judged the nativeness of the bilingual speakers. Results Evidence of L1 to L2 transfer was observed through duration but not amplitude cues. More proficient L2 speakers varied duration to mark iambic stress. Perceptually, the high proficiency group received relatively higher native-like accent ratings. Trochees were judged as more native than iambs. Interpretation Even in the face of L1-L2 lexical stress transfer, non-native speakers demonstrated knowledge of prosodic contrasts. Movement duration appears to be more amenable to modifications than amplitude. PMID:21106699
Bone, Daniel; Li, Ming; Black, Matthew P.; Narayanan, Shrikanth S.
2013-01-01
Segmental and suprasegmental speech signal modulations offer information about paralinguistic content such as affect, age and gender, pathology, and speaker state. Speaker state encompasses medium-term, temporary physiological phenomena influenced by internal or external biochemical actions (e.g., sleepiness, alcohol intoxication). Perceptual and computational research indicates that detecting speaker state from speech is a challenging task. In this paper, we present a system constructed with multiple representations of prosodic and spectral features that provided the best result at the Intoxication Subchallenge of Interspeech 2011 on the Alcohol Language Corpus. We discuss the details of each classifier and show that fusion improves performance. We additionally address the question of how best to construct a speaker state detection system in terms of robust and practical marginalization of associated variability such as through modeling speakers, utterance type, gender, and utterance length. As is the case in human perception, speaker normalization provides significant improvements to our system. We show that a held-out set of baseline (sober) data can be used to achieve comparable gains to other speaker normalization techniques. Our fused frame-level statistic-functional systems, fused GMM systems, and final combined system achieve unweighted average recalls (UARs) of 69.7%, 65.1%, and 68.8%, respectively, on the test set. More consistent numbers compared to development set results occur with matched-prompt training, where the UARs are 70.4%, 66.2%, and 71.4%, respectively. The combined system improves over the Challenge baseline by 5.5% absolute (8.4% relative), also improving upon our previously best result. PMID:24376305
Infant Directed Speech Enhances Statistical Learning in Newborn Infants: An ERP Study
Teinonen, Tuomas; Tervaniemi, Mari; Huotilainen, Minna
2016-01-01
Statistical learning and the social contexts of language addressed to infants are hypothesized to play important roles in early language development. Previous behavioral work has found that the exaggerated prosodic contours of infant-directed speech (IDS) facilitate statistical learning in 8-month-old infants. Here we examined the neural processes involved in on-line statistical learning and investigated whether the use of IDS facilitates statistical learning in sleeping newborns. Event-related potentials (ERPs) were recorded while newborns were exposed to12 pseudo-words, six spoken with exaggerated pitch contours of IDS and six spoken without exaggerated pitch contours (ADS) in ten alternating blocks. We examined whether ERP amplitudes for syllable position within a pseudo-word (word-initial vs. word-medial vs. word-final, indicating statistical word learning) and speech register (ADS vs. IDS) would interact. The ADS and IDS registers elicited similar ERP patterns for syllable position in an early 0–100 ms component but elicited different ERP effects in both the polarity and topographical distribution at 200–400 ms and 450–650 ms. These results provide the first evidence that the exaggerated pitch contours of IDS result in differences in brain activity linked to on-line statistical learning in sleeping newborns. PMID:27617967
Surface Management System Departure Event Data Analysis
NASA Technical Reports Server (NTRS)
Monroe, Gilena A.
2010-01-01
This paper presents a data analysis of the Surface Management System (SMS) performance of departure events, including push-back and runway departure events.The paper focuses on the detection performance, or the ability to detect departure events, as well as the prediction performance of SMS. The results detail a modest overall detection performance of push-back events and a significantly high overall detection performance of runway departure events. The overall detection performance of SMS for push-back events is approximately 55%.The overall detection performance of SMS for runway departure events nears 100%. This paper also presents the overall SMS prediction performance for runway departure events as well as the timeliness of the Aircraft Situation Display for Industry data source for SMS predictions.
ERP evidence for the recognition of emotional prosody through simulated cochlear implant strategies.
Agrawal, Deepashri; Timm, Lydia; Viola, Filipa Campos; Debener, Stefan; Büchner, Andreas; Dengler, Reinhard; Wittfoth, Matthias
2012-09-20
Emotionally salient information in spoken language can be provided by variations in speech melody (prosody) or by emotional semantics. Emotional prosody is essential to convey feelings through speech. In sensori-neural hearing loss, impaired speech perception can be improved by cochlear implants (CIs). Aim of this study was to investigate the performance of normal-hearing (NH) participants on the perception of emotional prosody with vocoded stimuli. Semantically neutral sentences with emotional (happy, angry and neutral) prosody were used. Sentences were manipulated to simulate two CI speech-coding strategies: the Advance Combination Encoder (ACE) and the newly developed Psychoacoustic Advanced Combination Encoder (PACE). Twenty NH adults were asked to recognize emotional prosody from ACE and PACE simulations. Performance was assessed using behavioral tests and event-related potentials (ERPs). Behavioral data revealed superior performance with original stimuli compared to the simulations. For simulations, better recognition for happy and angry prosody was observed compared to the neutral. Irrespective of simulated or unsimulated stimulus type, a significantly larger P200 event-related potential was observed for happy prosody after sentence onset than the other two emotions. Further, the amplitude of P200 was significantly more positive for PACE strategy use compared to the ACE strategy. Results suggested P200 peak as an indicator of active differentiation and recognition of emotional prosody. Larger P200 peak amplitude for happy prosody indicated importance of fundamental frequency (F0) cues in prosody processing. Advantage of PACE over ACE highlighted a privileged role of the psychoacoustic masking model in improving prosody perception. Taken together, the study emphasizes on the importance of vocoded simulation to better understand the prosodic cues which CI users may be utilizing.
A model of human event detection in multiple process monitoring situations
NASA Technical Reports Server (NTRS)
Greenstein, J. S.; Rouse, W. B.
1978-01-01
It is proposed that human decision making in many multi-task situations might be modeled in terms of the manner in which the human detects events related to his tasks and the manner in which he allocates his attention among his tasks once he feels events have occurred. A model of human event detection performance in such a situation is presented. An assumption of the model is that, in attempting to detect events, the human generates the probability that events have occurred. Discriminant analysis is used to model the human's generation of these probabilities. An experimental study of human event detection performance in a multiple process monitoring situation is described and the application of the event detection model to this situation is addressed. The experimental study employed a situation in which subjects simulataneously monitored several dynamic processes for the occurrence of events and made yes/no decisions on the presence of events in each process. Input to the event detection model of the information displayed to the experimental subjects allows comparison of the model's performance with the performance of the subjects.
Straatman, L V; Rietveld, A C M; Beijen, J; Mylanus, E A M; Mens, L H M
2010-10-01
Cochlear implants are largely unable to encode voice pitch information, which hampers the perception of some prosodic cues, such as intonation. This study investigated whether children with a cochlear implant in one ear were better able to detect differences in intonation when a hearing aid was added in the other ear ("bimodal fitting"). Fourteen children with normal hearing and 19 children with bimodal fitting participated in two experiments. The first experiment assessed the just noticeable difference in F0, by presenting listeners with a naturally produced bisyllabic utterance with an artificially manipulated pitch accent. The second experiment assessed the ability to distinguish between questions and affirmations in Dutch words, again by using artificial manipulation of F0. For the implanted group, performance significantly improved in each experiment when the hearing aid was added. However, even with a hearing aid, the implanted group required exaggerated F0 excursions to perceive a pitch accent and to identify a question. These exaggerated excursions are close to the maximum excursions typically used by Dutch speakers. Nevertheless, the results of this study showed that compared to the implant only condition, bimodal fitting improved the perception of intonation.
NASA Astrophysics Data System (ADS)
Yuki, Akiyama; Satoshi, Ueyama; Ryosuke, Shibasaki; Adachi, Ryuichiro
2016-06-01
In this study, we developed a method to detect sudden population concentration on a certain day and area, that is, an "Event," all over Japan in 2012 using mass GPS data provided from mobile phone users. First, stay locations of all phone users were detected using existing methods. Second, areas and days where Events occurred were detected by aggregation of mass stay locations into 1-km-square grid polygons. Finally, the proposed method could detect Events with an especially large number of visitors in the year by removing the influences of Events that occurred continuously throughout the year. In addition, we demonstrated reasonable reliability of the proposed Event detection method by comparing the results of Event detection with light intensities obtained from the night light images from the DMSP/OLS night light images. Our method can detect not only positive events such as festivals but also negative events such as natural disasters and road accidents. These results are expected to support policy development of urban planning, disaster prevention, and transportation management.
Multi-Station Broad Regional Event Detection Using Waveform Correlation
NASA Astrophysics Data System (ADS)
Slinkard, M.; Stephen, H.; Young, C. J.; Eckert, R.; Schaff, D. P.; Richards, P. G.
2013-12-01
Previous waveform correlation studies have established the occurrence of repeating seismic events in various regions, and the utility of waveform-correlation event-detection on broad regional or even global scales to find events currently not included in traditionally-prepared bulletins. The computational burden, however, is high, limiting previous experiments to relatively modest template libraries and/or processing time periods. We have developed a distributed computing waveform correlation event detection utility that allows us to process years of continuous waveform data with template libraries numbering in the thousands. We have used this system to process several years of waveform data from IRIS stations in East Asia, using libraries of template events taken from global and regional bulletins. Detections at a given station are confirmed by 1) comparison with independent bulletins of seismicity, and 2) consistent detections at other stations. We find that many of the detected events are not in traditional catalogs, hence the multi-station comparison is essential. In addition to detecting the similar events, we also estimate magnitudes very precisely based on comparison with the template events (when magnitudes are available). We have investigated magnitude variation within detected families of similar events, false alarm rates, and the temporal and spatial reach of templates.
Network hydraulics inclusion in water quality event detection using multiple sensor stations data.
Oliker, Nurit; Ostfeld, Avi
2015-09-01
Event detection is one of the current most challenging topics in water distribution systems analysis: how regular on-line hydraulic (e.g., pressure, flow) and water quality (e.g., pH, residual chlorine, turbidity) measurements at different network locations can be efficiently utilized to detect water quality contamination events. This study describes an integrated event detection model which combines multiple sensor stations data with network hydraulics. To date event detection modelling is likely limited to single sensor station location and dataset. Single sensor station models are detached from network hydraulics insights and as a result might be significantly exposed to false positive alarms. This work is aimed at decreasing this limitation through integrating local and spatial hydraulic data understanding into an event detection model. The spatial analysis complements the local event detection effort through discovering events with lower signatures by exploring the sensors mutual hydraulic influences. The unique contribution of this study is in incorporating hydraulic simulation information into the overall event detection process of spatially distributed sensors. The methodology is demonstrated on two example applications using base runs and sensitivity analyses. Results show a clear advantage of the suggested model over single-sensor event detection schemes. Copyright © 2015 Elsevier Ltd. All rights reserved.
A cross-linguistic fMRI study of perception of intonation and emotion in Chinese.
Gandour, Jack; Wong, Donald; Dzemidzic, Mario; Lowe, Mark; Tong, Yunxia; Li, Xiaojian
2003-03-01
Conflicting data from neurobehavioral studies of the perception of intonation (linguistic) and emotion (affective) in spoken language highlight the need to further examine how functional attributes of prosodic stimuli are related to hemispheric differences in processing capacity. Because of similarities in their acoustic profiles, intonation and emotion permit us to assess to what extent hemispheric lateralization of speech prosody depends on functional instead of acoustical properties. To examine how the brain processes linguistic and affective prosody, an fMRI study was conducted using Chinese, a tone language in which both intonation and emotion may be signaled prosodically, in addition to lexical tones. Ten Chinese and 10 English subjects were asked to perform discrimination judgments of intonation (I: statement, question) and emotion (E: happy, angry, sad) presented in semantically neutral Chinese sentences. A baseline task required passive listening to the same speech stimuli (S). In direct between-group comparisons, the Chinese group showed left-sided frontoparietal activation for both intonation (I vs. S) and emotion (E vs. S) relative to baseline. When comparing intonation relative to emotion (I vs. E), the Chinese group demonstrated prefrontal activation bilaterally; parietal activation in the left hemisphere only. The reverse comparison (E vs. I), on the other hand, revealed that activation occurred in anterior and posterior prefrontal regions of the right hemisphere only. These findings show that some aspects of perceptual processing of emotion are dissociable from intonation, and, moreover, that they are mediated by the right hemisphere. Copyright 2003 Wiley-Liss, Inc.
NASA Astrophysics Data System (ADS)
Gao, Pei-pei; Liu, Feng
2016-10-01
With the development of information technology and artificial intelligence, speech synthesis plays a significant role in the fields of Human-Computer Interaction Techniques. However, the main problem of current speech synthesis techniques is lacking of naturalness and expressiveness so that it is not yet close to the standard of natural language. Another problem is that the human-computer interaction based on the speech synthesis is too monotonous to realize mechanism of user subjective drive. This thesis introduces the historical development of speech synthesis and summarizes the general process of this technique. It is pointed out that prosody generation module is an important part in the process of speech synthesis. On the basis of further research, using eye activity rules when reading to control and drive prosody generation was introduced as a new human-computer interaction method to enrich the synthetic form. In this article, the present situation of speech synthesis technology is reviewed in detail. Based on the premise of eye gaze data extraction, using eye movement signal in real-time driving, a speech synthesis method which can express the real speech rhythm of the speaker is proposed. That is, when reader is watching corpora with its eyes in silent reading, capture the reading information such as the eye gaze duration per prosodic unit, and establish a hierarchical prosodic pattern of duration model to determine the duration parameters of synthesized speech. At last, after the analysis, the feasibility of the above method is verified.
Mass counts: ERP correlates of non-adjacent dependency learning under different exposure conditions.
Citron, Francesca M M; Oberecker, Regine; Friederici, Angela D; Mueller, Jutta L
2011-01-10
Miniature language learning can serve to model real language learning as high proficiency can be reached after very little exposure. In a previous study by Mueller et al. [18] German participants acquired non-adjacent syntactic dependencies by mere exposure to correct Italian sentences, but their ERP pattern differed from the one shown by native speakers. The present study follows up on that experiment using a similar design and material and is focused on two important issues: the influence of acoustic cues in the material and the impact of the learning procedure. With respect to the latter we compared alternating learning and test phases to a continuous learning and test phase. In addition, a splicing procedure eliminated prosodic cues in order to ensure that non-adjacent dependencies were learned instead of adjacent ones. Results for the continuous phase design showed a native-like biphasic ERP pattern, an N400 followed by a left-focused positivity. In the alternating design behavioural accuracy was lower and only an N400 was found. The results suggest an advantage of continuous learning phases for adult learners, possibly due to the absence of ungrammatical items present in the test phases in the alternating learning procedure. Furthermore, the replication of the earlier study with prosodically controlled material adds evidence to the general finding that syntactic non-adjacent dependencies can be learned from mere exposure to correct examples. Copyright © 2010 Elsevier Ireland Ltd. All rights reserved.
Automatic intelligibility classification of sentence-level pathological speech
Kim, Jangwon; Kumar, Naveen; Tsiartas, Andreas; Li, Ming; Narayanan, Shrikanth S.
2014-01-01
Pathological speech usually refers to the condition of speech distortion resulting from atypicalities in voice and/or in the articulatory mechanisms owing to disease, illness or other physical or biological insult to the production system. Although automatic evaluation of speech intelligibility and quality could come in handy in these scenarios to assist experts in diagnosis and treatment design, the many sources and types of variability often make it a very challenging computational processing problem. In this work we propose novel sentence-level features to capture abnormal variation in the prosodic, voice quality and pronunciation aspects in pathological speech. In addition, we propose a post-classification posterior smoothing scheme which refines the posterior of a test sample based on the posteriors of other test samples. Finally, we perform feature-level fusions and subsystem decision fusion for arriving at a final intelligibility decision. The performances are tested on two pathological speech datasets, the NKI CCRT Speech Corpus (advanced head and neck cancer) and the TORGO database (cerebral palsy or amyotrophic lateral sclerosis), by evaluating classification accuracy without overlapping subjects’ data among training and test partitions. Results show that the feature sets of each of the voice quality subsystem, prosodic subsystem, and pronunciation subsystem, offer significant discriminating power for binary intelligibility classification. We observe that the proposed posterior smoothing in the acoustic space can further reduce classification errors. The smoothed posterior score fusion of subsystems shows the best classification performance (73.5% for unweighted, and 72.8% for weighted, average recalls of the binary classes). PMID:25414544
Real-Time Event Detection for Monitoring Natural and Source ...
The use of event detection systems in finished drinking water systems is increasing in order to monitor water quality in both operational and security contexts. Recent incidents involving harmful algal blooms and chemical spills into watersheds have increased interest in monitoring source water quality prior to treatment. This work highlights the use of the CANARY event detection software in detecting suspected illicit events in an actively monitored watershed in South Carolina. CANARY is an open source event detection software that was developed by USEPA and Sandia National Laboratories. The software works with any type of sensor, utilizes multiple detection algorithms and approaches, and can incorporate operational information as needed. Monitoring has been underway for several years to detect events related to intentional or unintentional dumping of materials into the monitored watershed. This work evaluates the feasibility of using CANARY to enhance the detection of events in this watershed. This presentation will describe the real-time monitoring approach used in this watershed, the selection of CANARY configuration parameters that optimize detection for this watershed and monitoring application, and the performance of CANARY during the time frame analyzed. Further, this work will highlight how rainfall events impacted analysis, and the innovative application of CANARY taken in order to effectively detect the suspected illicit events. This presentation d
Distributed Events in Sentinel: Design and Implementation of a Global Event Detector
1999-01-01
local event detector and a global event detector to detect events. Global event detector in this case plays the role of a message sending/receiving than...significant in this case . The system performance will decrease with increase in the number of applications involved in global event detection. Yet from a...Figure 8: A Global event tree (2) 1. Global composite event is detected at the GED In this case , the whole global composite event tree is sent to the
Self-similarity Clustering Event Detection Based on Triggers Guidance
NASA Astrophysics Data System (ADS)
Zhang, Xianfei; Li, Bicheng; Tian, Yuxuan
Traditional method of Event Detection and Characterization (EDC) regards event detection task as classification problem. It makes words as samples to train classifier, which can lead to positive and negative samples of classifier imbalance. Meanwhile, there is data sparseness problem of this method when the corpus is small. This paper doesn't classify event using word as samples, but cluster event in judging event types. It adopts self-similarity to convergence the value of K in K-means algorithm by the guidance of event triggers, and optimizes clustering algorithm. Then, combining with named entity and its comparative position information, the new method further make sure the pinpoint type of event. The new method avoids depending on template of event in tradition methods, and its result of event detection can well be used in automatic text summarization, text retrieval, and topic detection and tracking.
[Study on the timeliness of detection and reporting on public health emergency events in China].
Li, Ke-Li; Feng, Zi-Jian; Ni, Da-Xin
2009-03-01
To analyze the timeliness of detection and reporting on public health emergency events, and to explore the effective strategies for improving the relative capacity on those issues. We conducted a retrospective survey on 3275 emergency events reported through Public Health Emergency Events Surveillance System from 2005 to the first half of 2006. Developed by county Centers for Disease Control and Prevention, a uniformed self-administrated questionnaire was used to collect data, which would include information on the detection, reporting of the events. For communicable diseases events, the median of time interval between the occurrence of first case and the detection of event was 6 days (P25 = 2, P75 = 13). For food poisoning events and clusters of disease with unknown origin, the medians were 3 hours (P25, P75 = 16) and 1 days (P25 = 0, P75 = 5). 71.54% of the events were reported by the discoverers within 2 hours after the detection. In general, the ranges of time intervals between the occurrence, detection or reporting of the events were different, according to the categories of events. The timeliness of detection and reporting of events could have been improved dramatically if the definition of events, according to their characteristics, had been more reasonable and accessible, as well as the improvement of training program for healthcare staff and teachers.
Automatic event recognition and anomaly detection with attribute grammar by learning scene semantics
NASA Astrophysics Data System (ADS)
Qi, Lin; Yao, Zhenyu; Li, Li; Dong, Junyu
2007-11-01
In this paper we present a novel framework for automatic event recognition and abnormal behavior detection with attribute grammar by learning scene semantics. This framework combines learning scene semantics by trajectory analysis and constructing attribute grammar-based event representation. The scene and event information is learned automatically. Abnormal behaviors that disobey scene semantics or event grammars rules are detected. By this method, an approach to understanding video scenes is achieved. Further more, with this prior knowledge, the accuracy of abnormal event detection is increased.
Subsurface event detection and classification using Wireless Signal Networks.
Yoon, Suk-Un; Ghazanfari, Ehsan; Cheng, Liang; Pamukcu, Sibel; Suleiman, Muhannad T
2012-11-05
Subsurface environment sensing and monitoring applications such as detection of water intrusion or a landslide, which could significantly change the physical properties of the host soil, can be accomplished using a novel concept, Wireless Signal Networks (WSiNs). The wireless signal networks take advantage of the variations of radio signal strength on the distributed underground sensor nodes of WSiNs to monitor and characterize the sensed area. To characterize subsurface environments for event detection and classification, this paper provides a detailed list and experimental data of soil properties on how radio propagation is affected by soil properties in subsurface communication environments. Experiments demonstrated that calibrated wireless signal strength variations can be used as indicators to sense changes in the subsurface environment. The concept of WSiNs for the subsurface event detection is evaluated with applications such as detection of water intrusion, relative density change, and relative motion using actual underground sensor nodes. To classify geo-events using the measured signal strength as a main indicator of geo-events, we propose a window-based minimum distance classifier based on Bayesian decision theory. The window-based classifier for wireless signal networks has two steps: event detection and event classification. With the event detection, the window-based classifier classifies geo-events on the event occurring regions that are called a classification window. The proposed window-based classification method is evaluated with a water leakage experiment in which the data has been measured in laboratory experiments. In these experiments, the proposed detection and classification method based on wireless signal network can detect and classify subsurface events.
Subsurface Event Detection and Classification Using Wireless Signal Networks
Yoon, Suk-Un; Ghazanfari, Ehsan; Cheng, Liang; Pamukcu, Sibel; Suleiman, Muhannad T.
2012-01-01
Subsurface environment sensing and monitoring applications such as detection of water intrusion or a landslide, which could significantly change the physical properties of the host soil, can be accomplished using a novel concept, Wireless Signal Networks (WSiNs). The wireless signal networks take advantage of the variations of radio signal strength on the distributed underground sensor nodes of WSiNs to monitor and characterize the sensed area. To characterize subsurface environments for event detection and classification, this paper provides a detailed list and experimental data of soil properties on how radio propagation is affected by soil properties in subsurface communication environments. Experiments demonstrated that calibrated wireless signal strength variations can be used as indicators to sense changes in the subsurface environment. The concept of WSiNs for the subsurface event detection is evaluated with applications such as detection of water intrusion, relative density change, and relative motion using actual underground sensor nodes. To classify geo-events using the measured signal strength as a main indicator of geo-events, we propose a window-based minimum distance classifier based on Bayesian decision theory. The window-based classifier for wireless signal networks has two steps: event detection and event classification. With the event detection, the window-based classifier classifies geo-events on the event occurring regions that are called a classification window. The proposed window-based classification method is evaluated with a water leakage experiment in which the data has been measured in laboratory experiments. In these experiments, the proposed detection and classification method based on wireless signal network can detect and classify subsurface events. PMID:23202191
Nishii, Nobuhiro; Miyoshi, Akihito; Kubo, Motoki; Miyamoto, Masakazu; Morimoto, Yoshimasa; Kawada, Satoshi; Nakagawa, Koji; Watanabe, Atsuyuki; Nakamura, Kazufumi; Morita, Hiroshi; Ito, Hiroshi
2018-03-01
Remote monitoring (RM) has been advocated as the new standard of care for patients with cardiovascular implantable electronic devices (CIEDs). RM has allowed the early detection of adverse clinical events, such as arrhythmia, lead failure, and battery depletion. However, lead failure was often identified only by arrhythmic events, but not impedance abnormalities. To compare the usefulness of arrhythmic events with conventional impedance abnormalities for identifying lead failure in CIED patients followed by RM. CIED patients in 12 hospitals have been followed by the RM center in Okayama University Hospital. All transmitted data have been analyzed and summarized. From April 2009 to March 2016, 1,873 patients have been followed by the RM center. During the mean follow-up period of 775 days, 42 lead failure events (atrial lead 22, right ventricular pacemaker lead 5, implantable cardioverter defibrillator [ICD] lead 15) were detected. The proportion of lead failures detected only by arrhythmic events, which were not detected by conventional impedance abnormalities, was significantly higher than that detected by impedance abnormalities (arrhythmic event 76.2%, 95% CI: 60.5-87.9%; impedance abnormalities 23.8%, 95% CI: 12.1-39.5%). Twenty-seven events (64.7%) were detected without any alert. Of 15 patients with ICD lead failure, none has experienced inappropriate therapy. RM can detect lead failure earlier, before clinical adverse events. However, CIEDs often diagnose lead failure as just arrhythmic events without any warning. Thus, to detect lead failure earlier, careful human analysis of arrhythmic events is useful. © 2017 Wiley Periodicals, Inc.
Full-waveform detection of non-impulsive seismic events based on time-reversal methods
NASA Astrophysics Data System (ADS)
Solano, Ericka Alinne; Hjörleifsdóttir, Vala; Liu, Qinya
2017-12-01
We present a full-waveform detection method for non-impulsive seismic events, based on time-reversal principles. We use the strain Green's tensor as a matched filter, correlating it with continuous observed seismograms, to detect non-impulsive seismic events. We show that this is mathematically equivalent to an adjoint method for detecting earthquakes. We define the detection function, a scalar valued function, which depends on the stacked correlations for a group of stations. Event detections are given by the times at which the amplitude of the detection function exceeds a given value relative to the noise level. The method can make use of the whole seismic waveform or any combination of time-windows with different filters. It is expected to have an advantage compared to traditional detection methods for events that do not produce energetic and impulsive P waves, for example glacial events, landslides, volcanic events and transform-fault earthquakes for events which velocity structure along the path is relatively well known. Furthermore, the method has advantages over empirical Greens functions template matching methods, as it does not depend on records from previously detected events, and therefore is not limited to events occurring in similar regions and with similar focal mechanisms as these events. The method is not specific to any particular way of calculating the synthetic seismograms, and therefore complicated structural models can be used. This is particularly beneficial for intermediate size events that are registered on regional networks, for which the effect of lateral structure on the waveforms can be significant. To demonstrate the feasibility of the method, we apply it to two different areas located along the mid-oceanic ridge system west of Mexico where non-impulsive events have been reported. The first study area is between Clipperton and Siqueiros transform faults (9°N), during the time of two earthquake swarms, occurring in March 2012 and May 2016. The second area of interest is the Gulf of California where two swarms took place during July and September of 2015. We show that we are able to detect previously non-reported, non-impulsive events and recommend that this method be used together with more traditional template matching methods to maximize the number of detected events.
Contribution of Infrasound to IDC Reviewed Event Bulletin
NASA Astrophysics Data System (ADS)
Bittner, Paulina; Polich, Paul; Gore, Jane; Ali, Sherif Mohamed; Medinskaya, Tatiana; Mialle, Pierrick
2016-04-01
Until 2003 two waveform technologies, i.e. seismic and hydroacoustic were used to detect and locate events included in the International Data Centre (IDC) Reviewed Event Bulletin (REB). The first atmospheric event was published in the REB in 2003 but infrasound detections could not be used by the Global Association (GA) Software due to the unmanageable high number of spurious associations. Offline improvements of the automatic processing took place to reduce the number of false detections to a reasonable level. In February 2010 the infrasound technology was reintroduced to the IDC operations and has contributed to both automatic and reviewed IDC bulletins. The primary contribution of infrasound technology is to detect atmospheric events. These events may also be observed at seismic stations, which will significantly improve event location. Examples of REB events, which were detected by the International Monitoring System (IMS) infrasound network were fireballs (e.g. Bangkok fireball, 2015), volcanic eruptions (e.g. Calbuco, Chile 2015) and large surface explosions (e.g. Tjanjin, China 2015). Query blasts and large earthquakes belong to events primarily recorded at seismic stations of the IMS network but often detected at the infrasound stations. Presence of infrasound detection associated to an event from a mining area indicates a surface explosion. Satellite imaging and a database of active mines can be used to confirm the origin of such events. This presentation will summarize the contribution of 6 years of infrasound data to IDC bulletins and provide examples of events recorded at the IMS infrasound network. Results of this study may help to improve location of small events with observations on infrasound stations.
The effectiveness of pretreatment physics plan review for detecting errors in radiation therapy.
Gopan, Olga; Zeng, Jing; Novak, Avrey; Nyflot, Matthew; Ford, Eric
2016-09-01
The pretreatment physics plan review is a standard tool for ensuring treatment quality. Studies have shown that the majority of errors in radiation oncology originate in treatment planning, which underscores the importance of the pretreatment physics plan review. This quality assurance measure is fundamentally important and central to the safety of patients and the quality of care that they receive. However, little is known about its effectiveness. The purpose of this study was to analyze reported incidents to quantify the effectiveness of the pretreatment physics plan review with the goal of improving it. This study analyzed 522 potentially severe or critical near-miss events within an institutional incident learning system collected over a three-year period. Of these 522 events, 356 originated at a workflow point that was prior to the pretreatment physics plan review. The remaining 166 events originated after the pretreatment physics plan review and were not considered in the study. The applicable 356 events were classified into one of the three categories: (1) events detected by the pretreatment physics plan review, (2) events not detected but "potentially detectable" by the physics review, and (3) events "not detectable" by the physics review. Potentially detectable events were further classified by which specific checks performed during the pretreatment physics plan review detected or could have detected the event. For these events, the associated specific check was also evaluated as to the possibility of automating that check given current data structures. For comparison, a similar analysis was carried out on 81 events from the international SAFRON radiation oncology incident learning system. Of the 356 applicable events from the institutional database, 180/356 (51%) were detected or could have been detected by the pretreatment physics plan review. Of these events, 125 actually passed through the physics review; however, only 38% (47/125) were actually detected at the review. Of the 81 events from the SAFRON database, 66/81 (81%) were potentially detectable by the pretreatment physics plan review. From the institutional database, three specific physics checks were particularly effective at detecting events (combined effectiveness of 38%): verifying the isocenter (39/180), verifying DRRs (17/180), and verifying that the plan matched the prescription (12/180). The most effective checks from the SAFRON database were verifying that the plan matched the prescription (13/66) and verifying the field parameters in the record and verify system against those in the plan (23/66). Software-based plan checking systems, if available, would have potential effectiveness of 29% and 64% at detecting events from the institutional and SAFRON databases, respectively. Pretreatment physics plan review is a key safety measure and can detect a high percentage of errors. However, the majority of errors that potentially could have been detected were not detected in this study, indicating the need to improve the pretreatment physics plan review performance. Suggestions for improvement include the automation of specific physics checks performed during the pretreatment physics plan review and the standardization of the review process.
Giannakidou, Anastasia; Etxeberria, Urtzi
2018-01-01
This paper reviews a series of experimental studies that address what we call “interface judgment,” which is the complex judgment involving integration from multiple levels of grammatical representation such as the syntax-semantics and prosody-semantics interface. We first discuss the results from the ERP literature connected to NPI licensing in different languages, paying particular attention to the N400 and the P600 as neural correlates of this specific phenomenon and focusing on the study by Xiang et al. (2016). The results of this study show evidence that there are two distinct NPI licensing mechanisms, i.e., licensing and rescuing, in line with Giannakidou (1998, 2006). Then we discuss an acceptability judgment task on Greek NPIs which supports the negativity as a scale hypothesis (Zwarts, 1995, 1996; Giannakidou, 1998). For the semantics-prosody interface judgment, we discuss two types of findings on two different phenomena and languages: (i) the study by Giannakidou and Yoon (2016) on scalar and non-scalar NPIs in Greek and Korean, which serves as the foundation for Chatzikonstantinou's (2016) study of production data showing distinct prosodic properties in emphatic (scalar) and non-emphatic (non-scalar) Greek NPIs; (ii) a (production and perception) study by Etxeberria and Irurtzun (2015) on the prosodic disambiguation of the scalar/non-scalar readings of sentences containing the focus particle “ere” in Basque. The main conclusion of the paper is that experimental methods of the kind discussed in the paper are useful in establishing physical, quantitative correlates of interface judgment. PMID:29515470
The origins of language and the evolution of music: A comparative perspective
NASA Astrophysics Data System (ADS)
Masataka, Nobuo
2009-03-01
According to Darwin [Darwin, CR. The descent of man, and selection in relation to sex. London: John Murray; 1871], the human musical faculty ‘must be ranked amongst the most mysterious with which he is endowed’. Music is a human cultural universal that serves no obvious adaptive purpose, making its evolution a puzzle for evolutionary biologists. This review examines Darwin's hypothesis of similarities between language and music indicating a shared evolutionary history. In particular, the fact that both are human universals, have phrase structure, and entail learning and cultural transmission, suggests that any theory of the evolution of language will have implications for the evolution of music, and vice versa. The argument starts by describing variable predispositional musical capabilities and the ontogeny of prosodic communication in human infants and young children, presenting comparative data regarding communication systems commonly present in living nonhuman primate species. Like language, the human music faculty is based on a suite of abilities, some of which are shared with other primates and some of which appear to be uniquely human. Each of these subcomponents may have a different evolutionary history, and should be discussed separately. After briefly considering possible functions of human music for language acquisition, the review ends by discussing the phylogenetic history of music. It concludes that many strands of evidence support Darwin's hypothesis of an intermediate stage of human evolutionary history, characterized by a communication system that resembled music more closely than language, but was identical to neither. This pre-linguistic system, which could probably referred to as “prosodic protolanguage”, provided a precursor for both modern language and music.
Hardy, Chris J D; Agustus, Jennifer L; Marshall, Charles R; Clark, Camilla N; Russell, Lucy L; Bond, Rebecca L; Brotherhood, Emilie V; Thomas, David L; Crutch, Sebastian J; Rohrer, Jonathan D; Warren, Jason D
2017-07-27
Non-verbal auditory impairment is increasingly recognised in the primary progressive aphasias (PPAs) but its relationship to speech processing and brain substrates has not been defined. Here we addressed these issues in patients representing the non-fluent variant (nfvPPA) and semantic variant (svPPA) syndromes of PPA. We studied 19 patients with PPA in relation to 19 healthy older individuals. We manipulated three key auditory parameters-temporal regularity, phonemic spectral structure and prosodic predictability (an index of fundamental information content, or entropy)-in sequences of spoken syllables. The ability of participants to process these parameters was assessed using two-alternative, forced-choice tasks and neuroanatomical associations of task performance were assessed using voxel-based morphometry of patients' brain magnetic resonance images. Relative to healthy controls, both the nfvPPA and svPPA groups had impaired processing of phonemic spectral structure and signal predictability while the nfvPPA group additionally had impaired processing of temporal regularity in speech signals. Task performance correlated with standard disease severity and neurolinguistic measures. Across the patient cohort, performance on the temporal regularity task was associated with grey matter in the left supplementary motor area and right caudate, performance on the phoneme processing task was associated with grey matter in the left supramarginal gyrus, and performance on the prosodic predictability task was associated with grey matter in the right putamen. Our findings suggest that PPA syndromes may be underpinned by more generic deficits of auditory signal analysis, with a distributed cortico-subcortical neuraoanatomical substrate extending beyond the canonical language network. This has implications for syndrome classification and biomarker development.
Wölwer, Wolfgang; Frommann, Nicole
2011-09-01
In the last decade, several social cognitive remediation programs have been developed for use in schizophrenia. Though existing evidence indicates that such programs can improve social cognition, which is essential for successful social functioning, it remains unclear whether the improvements generalize to social cognitive domains not primarily addressed by the intervention and whether the improved test performance transfers into everyday social functioning. The present study investigated whether, beyond its known effects on facial affect recognition, the Training of Affect Recognition (TAR) has effects on prosodic affect recognition, theory of mind (ToM) performance, social competence in a role-play task, and more general social and occupational functioning. Thirty-eight inpatients with a diagnosis of schizophrenia or schizoaffective disorder were randomly assigned to 6 weeks of treatment with the TAR--primarily targeted at facial affect recognition-or Cognitive Remediation Training (CRT)--primarily targeted at neurocognition. Intention-to-treat analyses found significantly larger pre-post improvements with TAR than with CRT in prosodic affect recognition, ToM, and social competence and a trend effect in global social functioning. However, the effects on ToM and social competence were no longer significant in the smaller group of patients who completed treatment according to protocol. Results suggest that TAR effects generalize to other social cognitive domains not primarily addressed. TAR may also enhance social skills and social functioning, although this has to be confirmed. Results are discussed with regard to the need to improve functional outcome in schizophrenia against the background of current evidence from other social cognitive remediation approaches.
Embedded security system for multi-modal surveillance in a railway carriage
NASA Astrophysics Data System (ADS)
Zouaoui, Rhalem; Audigier, Romaric; Ambellouis, Sébastien; Capman, François; Benhadda, Hamid; Joudrier, Stéphanie; Sodoyer, David; Lamarque, Thierry
2015-10-01
Public transport security is one of the main priorities of the public authorities when fighting against crime and terrorism. In this context, there is a great demand for autonomous systems able to detect abnormal events such as violent acts aboard passenger cars and intrusions when the train is parked at the depot. To this end, we present an innovative approach which aims at providing efficient automatic event detection by fusing video and audio analytics and reducing the false alarm rate compared to classical stand-alone video detection. The multi-modal system is composed of two microphones and one camera and integrates onboard video and audio analytics and fusion capabilities. On the one hand, for detecting intrusion, the system relies on the fusion of "unusual" audio events detection with intrusion detections from video processing. The audio analysis consists in modeling the normal ambience and detecting deviation from the trained models during testing. This unsupervised approach is based on clustering of automatically extracted segments of acoustic features and statistical Gaussian Mixture Model (GMM) modeling of each cluster. The intrusion detection is based on the three-dimensional (3D) detection and tracking of individuals in the videos. On the other hand, for violent events detection, the system fuses unsupervised and supervised audio algorithms with video event detection. The supervised audio technique detects specific events such as shouts. A GMM is used to catch the formant structure of a shout signal. Video analytics use an original approach for detecting aggressive motion by focusing on erratic motion patterns specific to violent events. As data with violent events is not easily available, a normality model with structured motions from non-violent videos is learned for one-class classification. A fusion algorithm based on Dempster-Shafer's theory analyses the asynchronous detection outputs and computes the degree of belief of each probable event.
Tokarchuk, Laurissa; Wang, Xinyue; Poslad, Stefan
2017-01-01
In an age when people are predisposed to report real-world events through their social media accounts, many researchers value the benefits of mining user generated content from social media. Compared with the traditional news media, social media services, such as Twitter, can provide more complete and timely information about the real-world events. However events are often like a puzzle and in order to solve the puzzle/understand the event, we must identify all the sub-events or pieces. Existing Twitter event monitoring systems for sub-event detection and summarization currently typically analyse events based on partial data as conventional data collection methodologies are unable to collect comprehensive event data. This results in existing systems often being unable to report sub-events in real-time and often in completely missing sub-events or pieces in the broader event puzzle. This paper proposes a Sub-event detection by real-TIme Microblog monitoring (STRIM) framework that leverages the temporal feature of an expanded set of news-worthy event content. In order to more comprehensively and accurately identify sub-events this framework first proposes the use of adaptive microblog crawling. Our adaptive microblog crawler is capable of increasing the coverage of events while minimizing the amount of non-relevant content. We then propose a stream division methodology that can be accomplished in real time so that the temporal features of the expanded event streams can be analysed by a burst detection algorithm. In the final steps of the framework, the content features are extracted from each divided stream and recombined to provide a final summarization of the sub-events. The proposed framework is evaluated against traditional event detection using event recall and event precision metrics. Results show that improving the quality and coverage of event contents contribute to better event detection by identifying additional valid sub-events. The novel combination of our proposed adaptive crawler and our stream division/recombination technique provides significant gains in event recall (44.44%) and event precision (9.57%). The addition of these sub-events or pieces, allows us to get closer to solving the event puzzle. PMID:29107976
Tokarchuk, Laurissa; Wang, Xinyue; Poslad, Stefan
2017-01-01
In an age when people are predisposed to report real-world events through their social media accounts, many researchers value the benefits of mining user generated content from social media. Compared with the traditional news media, social media services, such as Twitter, can provide more complete and timely information about the real-world events. However events are often like a puzzle and in order to solve the puzzle/understand the event, we must identify all the sub-events or pieces. Existing Twitter event monitoring systems for sub-event detection and summarization currently typically analyse events based on partial data as conventional data collection methodologies are unable to collect comprehensive event data. This results in existing systems often being unable to report sub-events in real-time and often in completely missing sub-events or pieces in the broader event puzzle. This paper proposes a Sub-event detection by real-TIme Microblog monitoring (STRIM) framework that leverages the temporal feature of an expanded set of news-worthy event content. In order to more comprehensively and accurately identify sub-events this framework first proposes the use of adaptive microblog crawling. Our adaptive microblog crawler is capable of increasing the coverage of events while minimizing the amount of non-relevant content. We then propose a stream division methodology that can be accomplished in real time so that the temporal features of the expanded event streams can be analysed by a burst detection algorithm. In the final steps of the framework, the content features are extracted from each divided stream and recombined to provide a final summarization of the sub-events. The proposed framework is evaluated against traditional event detection using event recall and event precision metrics. Results show that improving the quality and coverage of event contents contribute to better event detection by identifying additional valid sub-events. The novel combination of our proposed adaptive crawler and our stream division/recombination technique provides significant gains in event recall (44.44%) and event precision (9.57%). The addition of these sub-events or pieces, allows us to get closer to solving the event puzzle.
2013-04-24
DETECT: A MATLAB Toolbox for Event Detection and Identification in Time Series, with Applications to Artifact Detection in EEG Signals Vernon...datasets in the context of events, which are intervals of time where the properties of the signal change relative to a baseline signal . We have developed...As an illustration, we discuss application of the DETECT toolbox for detecting signal artifacts found in continuous multi-channel EEG recordings and
TERMA Framework for Biomedical Signal Analysis: An Economic-Inspired Approach.
Elgendi, Mohamed
2016-11-02
Biomedical signals contain features that represent physiological events, and each of these events has peaks. The analysis of biomedical signals for monitoring or diagnosing diseases requires the detection of these peaks, making event detection a crucial step in biomedical signal processing. Many researchers have difficulty detecting these peaks to investigate, interpret and analyze their corresponding events. To date, there is no generic framework that captures these events in a robust, efficient and consistent manner. A new method referred to for the first time as two event-related moving averages ("TERMA") involves event-related moving averages and detects events in biomedical signals. The TERMA framework is flexible and universal and consists of six independent LEGO building bricks to achieve high accuracy detection of biomedical events. Results recommend that the window sizes for the two moving averages ( W 1 and W 2 ) have to follow the inequality ( 8 × W 1 ) ≥ W 2 ≥ ( 2 × W 1 ) . Moreover, TERMA is a simple yet efficient event detector that is suitable for wearable devices, point-of-care devices, fitness trackers and smart watches, compared to more complex machine learning solutions.
A Probabilistic Approach to Network Event Formation from Pre-Processed Waveform Data
NASA Astrophysics Data System (ADS)
Kohl, B. C.; Given, J.
2017-12-01
The current state of the art for seismic event detection still largely depends on signal detection at individual sensor stations, including picking accurate arrivals times and correctly identifying phases, and relying on fusion algorithms to associate individual signal detections to form event hypotheses. But increasing computational capability has enabled progress toward the objective of fully utilizing body-wave recordings in an integrated manner to detect events without the necessity of previously recorded ground truth events. In 2011-2012 Leidos (then SAIC) operated a seismic network to monitor activity associated with geothermal field operations in western Nevada. We developed a new association approach for detecting and quantifying events by probabilistically combining pre-processed waveform data to deal with noisy data and clutter at local distance ranges. The ProbDet algorithm maps continuous waveform data into continuous conditional probability traces using a source model (e.g. Brune earthquake or Mueller-Murphy explosion) to map frequency content and an attenuation model to map amplitudes. Event detection and classification is accomplished by combining the conditional probabilities from the entire network using a Bayesian formulation. This approach was successful in producing a high-Pd, low-Pfa automated bulletin for a local network and preliminary tests with regional and teleseismic data show that it has promise for global seismic and nuclear monitoring applications. The approach highlights several features that we believe are essential to achieving low-threshold automated event detection: Minimizes the utilization of individual seismic phase detections - in traditional techniques, errors in signal detection, timing, feature measurement and initial phase ID compound and propagate into errors in event formation, Has a formalized framework that utilizes information from non-detecting stations, Has a formalized framework that utilizes source information, in particular the spectral characteristics of events of interest, Is entirely model-based, i.e. does not rely on a priori's - particularly important for nuclear monitoring, Does not rely on individualized signal detection thresholds - it's the network solution that matters.
Results from the MACHO Galactic Pixel Lensing Search
NASA Astrophysics Data System (ADS)
Drake, Andrew J.; Minniti, Dante; Alcock, Charles; Allsman, Robyn A.; Alves, David; Axelrod, Tim S.; Becker, Andrew C.; Bennett, David; Cook, Kem H.; Freeman, Ken C.; Griest, Kim; Lehner, Matt; Marshall, Stuart; Peterson, Bruce; Pratt, Mark; Quinn, Peter; Rodgers, Alex; Stubbs, Chris; Sutherland, Will; Tomaney, Austin; Vandehei, Thor; Welch, Doug L.
The MACHO, EROS, OGLE and AGAPE collaborations have been studying nature of the galactic halo for a number of years using microlensing events. The MACHO group undertakes observations of the LMC, SMC and Galactic Bulge monitoring the light curves of millions of stars to detect microlensing. Most of these fields are crowded to the extent that all the monitored stars are blended. Such crowding makes the performance of accurate photometry difficult. We apply the new technique of Difference Image Analysis (DIA) on archival data to improve the photometry and increase both the detection sensitivity and effective search area. The application of this technique also allows us to detect so called `pixel lensing' events. These are microlensing events where the source star is only detectable during lensing. The detection of these events will allow us to make a large increase in the number of detected microlensing events. We present a light curve demonstrating the detection of a pixel lensing event with this technique.
NASA Technical Reports Server (NTRS)
Totman, Peter D. (Inventor); Everton, Randy L. (Inventor); Egget, Mark R. (Inventor); Macon, David J. (Inventor)
2007-01-01
A method and apparatus for detecting and determining event characteristics such as, for example, the material failure of a component, in a manner which significantly reduces the amount of data collected. A sensor array, including a plurality of individual sensor elements, is coupled to a programmable logic device (PLD) configured to operate in a passive state and an active state. A triggering event is established such that the PLD records information only upon detection of the occurrence of the triggering event which causes a change in state within one or more of the plurality of sensor elements. Upon the occurrence of the triggering event, the change in state of the one or more sensor elements causes the PLD to record in memory which sensor element detected the event and at what time the event was detected. The PLD may be coupled with a computer for subsequent downloading and analysis of the acquired data.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chung, Sun-Ju; Lee, Chung-Uk; Koo, Jae-Rim, E-mail: sjchung@kasi.re.kr, E-mail: leecu@kasi.re.kr, E-mail: koojr@kasi.re.kr
2014-04-20
Even though the recently discovered high-magnification event MOA-2010-BLG-311 had complete coverage over its peak, confident planet detection did not happen due to extremely weak central perturbations (EWCPs, fractional deviations of ≲ 2%). For confident detection of planets in EWCP events, it is necessary to have both high cadence monitoring and high photometric accuracy better than those of current follow-up observation systems. The next-generation ground-based observation project, Korea Microlensing Telescope Network (KMTNet), satisfies these conditions. We estimate the probability of occurrence of EWCP events with fractional deviations of ≤2% in high-magnification events and the efficiency of detecting planets in the EWCPmore » events using the KMTNet. From this study, we find that the EWCP events occur with a frequency of >50% in the case of ≲ 100 M {sub E} planets with separations of 0.2 AU ≲ d ≲ 20 AU. We find that for main-sequence and sub-giant source stars, ≳ 1 M {sub E} planets in EWCP events with deviations ≤2% can be detected with frequency >50% in a certain range that changes with the planet mass. However, it is difficult to detect planets in EWCP events of bright stars like giant stars because it is easy for KMTNet to be saturated around the peak of the events because of its constant exposure time. EWCP events are caused by close, intermediate, and wide planetary systems with low-mass planets and close and wide planetary systems with massive planets. Therefore, we expect that a much greater variety of planetary systems than those already detected, which are mostly intermediate planetary systems, regardless of the planet mass, will be significantly detected in the near future.« less
The effectiveness of pretreatment physics plan review for detecting errors in radiation therapy
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gopan, Olga; Zeng, Jing; Novak, Avrey
Purpose: The pretreatment physics plan review is a standard tool for ensuring treatment quality. Studies have shown that the majority of errors in radiation oncology originate in treatment planning, which underscores the importance of the pretreatment physics plan review. This quality assurance measure is fundamentally important and central to the safety of patients and the quality of care that they receive. However, little is known about its effectiveness. The purpose of this study was to analyze reported incidents to quantify the effectiveness of the pretreatment physics plan review with the goal of improving it. Methods: This study analyzed 522 potentiallymore » severe or critical near-miss events within an institutional incident learning system collected over a three-year period. Of these 522 events, 356 originated at a workflow point that was prior to the pretreatment physics plan review. The remaining 166 events originated after the pretreatment physics plan review and were not considered in the study. The applicable 356 events were classified into one of the three categories: (1) events detected by the pretreatment physics plan review, (2) events not detected but “potentially detectable” by the physics review, and (3) events “not detectable” by the physics review. Potentially detectable events were further classified by which specific checks performed during the pretreatment physics plan review detected or could have detected the event. For these events, the associated specific check was also evaluated as to the possibility of automating that check given current data structures. For comparison, a similar analysis was carried out on 81 events from the international SAFRON radiation oncology incident learning system. Results: Of the 356 applicable events from the institutional database, 180/356 (51%) were detected or could have been detected by the pretreatment physics plan review. Of these events, 125 actually passed through the physics review; however, only 38% (47/125) were actually detected at the review. Of the 81 events from the SAFRON database, 66/81 (81%) were potentially detectable by the pretreatment physics plan review. From the institutional database, three specific physics checks were particularly effective at detecting events (combined effectiveness of 38%): verifying the isocenter (39/180), verifying DRRs (17/180), and verifying that the plan matched the prescription (12/180). The most effective checks from the SAFRON database were verifying that the plan matched the prescription (13/66) and verifying the field parameters in the record and verify system against those in the plan (23/66). Software-based plan checking systems, if available, would have potential effectiveness of 29% and 64% at detecting events from the institutional and SAFRON databases, respectively. Conclusions: Pretreatment physics plan review is a key safety measure and can detect a high percentage of errors. However, the majority of errors that potentially could have been detected were not detected in this study, indicating the need to improve the pretreatment physics plan review performance. Suggestions for improvement include the automation of specific physics checks performed during the pretreatment physics plan review and the standardization of the review process.« less
NASA Astrophysics Data System (ADS)
Reynen, Andrew; Audet, Pascal
2017-09-01
A new method using a machine learning technique is applied to event classification and detection at seismic networks. This method is applicable to a variety of network sizes and settings. The algorithm makes use of a small catalogue of known observations across the entire network. Two attributes, the polarization and frequency content, are used as input to regression. These attributes are extracted at predicted arrival times for P and S waves using only an approximate velocity model, as attributes are calculated over large time spans. This method of waveform characterization is shown to be able to distinguish between blasts and earthquakes with 99 per cent accuracy using a network of 13 stations located in Southern California. The combination of machine learning with generalized waveform features is further applied to event detection in Oklahoma, United States. The event detection algorithm makes use of a pair of unique seismic phases to locate events, with a precision directly related to the sampling rate of the generalized waveform features. Over a week of data from 30 stations in Oklahoma, United States are used to automatically detect 25 times more events than the catalogue of the local geological survey, with a false detection rate of less than 2 per cent. This method provides a highly confident way of detecting and locating events. Furthermore, a large number of seismic events can be automatically detected with low false alarm, allowing for a larger automatic event catalogue with a high degree of trust.
Video Traffic Analysis for Abnormal Event Detection
DOT National Transportation Integrated Search
2010-01-01
We propose the use of video imaging sensors for the detection and classification of abnormal events to be used primarily for mitigation of traffic congestion. Successful detection of such events will allow for new road guidelines; for rapid deploymen...
Video traffic analysis for abnormal event detection.
DOT National Transportation Integrated Search
2010-01-01
We propose the use of video imaging sensors for the detection and classification of abnormal events to : be used primarily for mitigation of traffic congestion. Successful detection of such events will allow for : new road guidelines; for rapid deplo...
Perception of conversations: the importance of semantics and intonation in children's development.
Keitel, Anne; Prinz, Wolfgang; Friederici, Angela D; von Hofsten, Claes; Daum, Moritz M
2013-10-01
In conversations, adults readily detect and anticipate the end of a speaker's turn. However, little is known about the development of this ability. We addressed two important aspects involved in the perception of conversational turn taking: semantic content and intonational form. The influence of semantics was investigated by testing prelinguistic and linguistic children. The influence of intonation was tested by presenting participants with videos of two dyadic conversations: one with normal intonation and one with flattened (removed) intonation. Children of four different age groups--two prelinguistic groups (6- and 12-month-olds) and two linguistic groups (24- and 36-month-olds)--and an adult group participated. Their eye movements were recorded, and the frequency of anticipated turns was analyzed. Our results show that (a) the anticipation of turns was reliable only in 3-year-olds and adults, with younger children shifting their gaze between speakers regardless of the turn taking, and (b) only 3-year-olds anticipated turns better if intonation was normal. These results indicate that children anticipate turns in conversations in a manner comparable (but not identical) to adults only after they have developed a sophisticated understanding of language. In contrast to adults, 3-year-olds rely more strongly on prosodic information during the perception of conversational turn taking. Copyright © 2013 Elsevier Inc. All rights reserved.
A universal cue for grammatical categories in the input to children: Frequent frames.
Moran, Steven; Blasi, Damián E; Schikowski, Robert; Küntay, Aylin C; Pfeiler, Barbara; Allen, Shanley; Stoll, Sabine
2018-06-01
How does a child map words to grammatical categories when words are not overtly marked either lexically or prosodically? Recent language acquisition theories have proposed that distributional information encoded in sequences of words or morphemes might play a central role in forming grammatical classes. To test this proposal, we analyze child-directed speech from seven typologically diverse languages to simulate maximum variation in the structures of the world's languages. We ask whether the input to children contains cues for assigning syntactic categories in frequent frames, which are frequently occurring nonadjacent sequences of words or morphemes. In accord with aggregated results from previous studies on individual languages, we find that frequent word frames do not provide a robust distributional pattern for accurately predicting grammatical categories. However, our results show that frames are extremely accurate cues cross-linguistically at the morpheme level. We theorize that the nonadjacent dependency pattern captured by frequent frames is a universal anchor point for learners on the morphological level to detect and categorize grammatical categories. Whether frames also play a role on higher linguistic levels such as words is determined by grammatical features of the individual language. Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.
Modeling Concept Dependencies for Event Detection
2014-04-04
Gaussian Mixture Model (GMM). Jiang et al . [8] provide a summary of experiments for TRECVID MED 2010 . They employ low-level features such as SIFT and...event detection literature. Ballan et al . [2] present a method to introduce temporal information for video event detection with a BoW (bag-of-words...approach. Zhou et al . [24] study video event detection by encoding a video with a set of bag of SIFT feature vectors and describe the distribution with a
Zhu, Pengyu; Fu, Wei; Wang, Chenguang; Du, Zhixin; Huang, Kunlun; Zhu, Shuifang; Xu, Wentao
2016-04-15
The possibility of the absolute quantitation of GMO events by digital PCR was recently reported. However, most absolute quantitation methods based on the digital PCR required pretreatment steps. Meanwhile, singleplex detection could not meet the demand of the absolute quantitation of GMO events that is based on the ratio of foreign fragments and reference genes. Thus, to promote the absolute quantitative detection of different GMO events by digital PCR, we developed a quantitative detection method based on duplex digital PCR without pretreatment. Moreover, we tested 7 GMO events in our study to evaluate the fitness of our method. The optimized combination of foreign and reference primers, limit of quantitation (LOQ), limit of detection (LOD) and specificity were validated. The results showed that the LOQ of our method for different GMO events was 0.5%, while the LOD is 0.1%. Additionally, we found that duplex digital PCR could achieve the detection results with lower RSD compared with singleplex digital PCR. In summary, the duplex digital PCR detection system is a simple and stable way to achieve the absolute quantitation of different GMO events. Moreover, the LOQ and LOD indicated that this method is suitable for the daily detection and quantitation of GMO events. Copyright © 2016 Elsevier B.V. All rights reserved.
Detecting Earthquakes over a Seismic Network using Single-Station Similarity Measures
NASA Astrophysics Data System (ADS)
Bergen, Karianne J.; Beroza, Gregory C.
2018-03-01
New blind waveform-similarity-based detection methods, such as Fingerprint and Similarity Thresholding (FAST), have shown promise for detecting weak signals in long-duration, continuous waveform data. While blind detectors are capable of identifying similar or repeating waveforms without templates, they can also be susceptible to false detections due to local correlated noise. In this work, we present a set of three new methods that allow us to extend single-station similarity-based detection over a seismic network; event-pair extraction, pairwise pseudo-association, and event resolution complete a post-processing pipeline that combines single-station similarity measures (e.g. FAST sparse similarity matrix) from each station in a network into a list of candidate events. The core technique, pairwise pseudo-association, leverages the pairwise structure of event detections in its network detection model, which allows it to identify events observed at multiple stations in the network without modeling the expected move-out. Though our approach is general, we apply it to extend FAST over a sparse seismic network. We demonstrate that our network-based extension of FAST is both sensitive and maintains a low false detection rate. As a test case, we apply our approach to two weeks of continuous waveform data from five stations during the foreshock sequence prior to the 2014 Mw 8.2 Iquique earthquake. Our method identifies nearly five times as many events as the local seismicity catalog (including 95% of the catalog events), and less than 1% of these candidate events are false detections.
Intrinsic fundamental frequency of vowels is moderated by regional dialect
Jacewicz, Ewa; Fox, Robert Allen
2015-01-01
There has been a long-standing debate whether the intrinsic fundamental frequency (IF0) of vowels is an automatic consequence of articulation or whether it is independently controlled by speakers to perceptually enhance vowel contrasts along the height dimension. This paper provides evidence from regional variation in American English that IF0 difference between high and low vowels is, in part, controlled and varies across dialects. The sources of this F0 control are socio-cultural and cannot be attributed to differences in the vowel inventory size. The socially motivated enhancement was found only in prosodically prominent contexts. PMID:26520352
Generalized Detectability for Discrete Event Systems
Shu, Shaolong; Lin, Feng
2011-01-01
In our previous work, we investigated detectability of discrete event systems, which is defined as the ability to determine the current and subsequent states of a system based on observation. For different applications, we defined four types of detectabilities: (weak) detectability, strong detectability, (weak) periodic detectability, and strong periodic detectability. In this paper, we extend our results in three aspects. (1) We extend detectability from deterministic systems to nondeterministic systems. Such a generalization is necessary because there are many systems that need to be modeled as nondeterministic discrete event systems. (2) We develop polynomial algorithms to check strong detectability. The previous algorithms are based on observer whose construction is of exponential complexity, while the new algorithms are based on a new automaton called detector. (3) We extend detectability to D-detectability. While detectability requires determining the exact state of a system, D-detectability relaxes this requirement by asking only to distinguish certain pairs of states. With these extensions, the theory on detectability of discrete event systems becomes more applicable in solving many practical problems. PMID:21691432
TERMA Framework for Biomedical Signal Analysis: An Economic-Inspired Approach
Elgendi, Mohamed
2016-01-01
Biomedical signals contain features that represent physiological events, and each of these events has peaks. The analysis of biomedical signals for monitoring or diagnosing diseases requires the detection of these peaks, making event detection a crucial step in biomedical signal processing. Many researchers have difficulty detecting these peaks to investigate, interpret and analyze their corresponding events. To date, there is no generic framework that captures these events in a robust, efficient and consistent manner. A new method referred to for the first time as two event-related moving averages (“TERMA”) involves event-related moving averages and detects events in biomedical signals. The TERMA framework is flexible and universal and consists of six independent LEGO building bricks to achieve high accuracy detection of biomedical events. Results recommend that the window sizes for the two moving averages (W1 and W2) have to follow the inequality (8×W1)≥W2≥(2×W1). Moreover, TERMA is a simple yet efficient event detector that is suitable for wearable devices, point-of-care devices, fitness trackers and smart watches, compared to more complex machine learning solutions. PMID:27827852
Detecting and Locating Seismic Events Without Phase Picks or Velocity Models
NASA Astrophysics Data System (ADS)
Arrowsmith, S.; Young, C. J.; Ballard, S.; Slinkard, M.
2015-12-01
The standard paradigm for seismic event monitoring is to scan waveforms from a network of stations and identify the arrival time of various seismic phases. A signal association algorithm then groups the picks to form events, which are subsequently located by minimizing residuals between measured travel times and travel times predicted by an Earth model. Many of these steps are prone to significant errors which can lead to erroneous arrival associations and event locations. Here, we revisit a concept for event detection that does not require phase picks or travel time curves and fuses detection, association and location into a single algorithm. Our pickless event detector exploits existing catalog and waveform data to build an empirical stack of the full regional seismic wavefield, which is subsequently used to detect and locate events at a network level using correlation techniques. Because the technique uses more of the information content of the original waveforms, the concept is particularly powerful for detecting weak events that would be missed by conventional methods. We apply our detector to seismic data from the University of Utah Seismograph Stations network and compare our results with the earthquake catalog published by the University of Utah. We demonstrate that the pickless detector can detect and locate significant numbers of events previously missed by standard data processing techniques.
Zhou, Hui; Ji, Ning; Samuel, Oluwarotimi Williams; Cao, Yafei; Zhao, Zheyi; Chen, Shixiong; Li, Guanglin
2016-10-01
Real-time detection of gait events can be applied as a reliable input to control drop foot correction devices and lower-limb prostheses. Among the different sensors used to acquire the signals associated with walking for gait event detection, the accelerometer is considered as a preferable sensor due to its convenience of use, small size, low cost, reliability, and low power consumption. Based on the acceleration signals, different algorithms have been proposed to detect toe off (TO) and heel strike (HS) gait events in previous studies. While these algorithms could achieve a relatively reasonable performance in gait event detection, they suffer from limitations such as poor real-time performance and are less reliable in the cases of up stair and down stair terrains. In this study, a new algorithm is proposed to detect the gait events on three walking terrains in real-time based on the analysis of acceleration jerk signals with a time-frequency method to obtain gait parameters, and then the determination of the peaks of jerk signals using peak heuristics. The performance of the newly proposed algorithm was evaluated with eight healthy subjects when they were walking on level ground, up stairs, and down stairs. Our experimental results showed that the mean F1 scores of the proposed algorithm were above 0.98 for HS event detection and 0.95 for TO event detection on the three terrains. This indicates that the current algorithm would be robust and accurate for gait event detection on different terrains. Findings from the current study suggest that the proposed method may be a preferable option in some applications such as drop foot correction devices and leg prostheses.
Zhou, Hui; Ji, Ning; Samuel, Oluwarotimi Williams; Cao, Yafei; Zhao, Zheyi; Chen, Shixiong; Li, Guanglin
2016-01-01
Real-time detection of gait events can be applied as a reliable input to control drop foot correction devices and lower-limb prostheses. Among the different sensors used to acquire the signals associated with walking for gait event detection, the accelerometer is considered as a preferable sensor due to its convenience of use, small size, low cost, reliability, and low power consumption. Based on the acceleration signals, different algorithms have been proposed to detect toe off (TO) and heel strike (HS) gait events in previous studies. While these algorithms could achieve a relatively reasonable performance in gait event detection, they suffer from limitations such as poor real-time performance and are less reliable in the cases of up stair and down stair terrains. In this study, a new algorithm is proposed to detect the gait events on three walking terrains in real-time based on the analysis of acceleration jerk signals with a time-frequency method to obtain gait parameters, and then the determination of the peaks of jerk signals using peak heuristics. The performance of the newly proposed algorithm was evaluated with eight healthy subjects when they were walking on level ground, up stairs, and down stairs. Our experimental results showed that the mean F1 scores of the proposed algorithm were above 0.98 for HS event detection and 0.95 for TO event detection on the three terrains. This indicates that the current algorithm would be robust and accurate for gait event detection on different terrains. Findings from the current study suggest that the proposed method may be a preferable option in some applications such as drop foot correction devices and leg prostheses. PMID:27706086
Systematic detection of seismic events at Mount St. Helens with an ultra-dense array
NASA Astrophysics Data System (ADS)
Meng, X.; Hartog, J. R.; Schmandt, B.; Hotovec-Ellis, A. J.; Hansen, S. M.; Vidale, J. E.; Vanderplas, J.
2016-12-01
During the summer of 2014, an ultra-dense array of 900 geophones was deployed around the crater of Mount St. Helens and continuously operated for 15 days. This dataset provides us an unprecedented opportunity to systematically detect seismic events around an active volcano and study their underlying mechanisms. We use a waveform-based matched filter technique to detect seismic events from this dataset. Due to the large volume of continuous data ( 1 TB), we performed the detection on the GPU cluster Stampede (https://www.tacc.utexas.edu/systems/stampede). We build a suite of template events from three catalogs: 1) the standard Pacific Northwest Seismic Network (PNSN) catalog (45 events); 2) the catalog from Hansen&Schmandt (2015) obtained with a reverse-time imaging method (212 events); and 3) the catalog identified with a matched filter technique using the PNSN permanent stations (190 events). By searching for template matches in the ultra-dense array, we find 2237 events. We then calibrate precise relative magnitudes for template and detected events, using a principal component fit to measure waveform amplitude ratios. The magnitude of completeness and b-value of the detected catalog is -0.5 and 1.1, respectively. Our detected catalog shows several intensive swarms, which are likely driven by fluid pressure transients in conduits or slip transients on faults underneath the volcano. We are currently relocating the detected catalog with HypoDD and measuring the seismic velocity changes at Mount St. Helens using the coda wave interferometry of detected repeating earthquakes. The accurate temporal-spatial migration pattern of seismicity and seismic property changes should shed light on the physical processes beneath Mount St. Helens.
Automatic Detection and Classification of Audio Events for Road Surveillance Applications.
Almaadeed, Noor; Asim, Muhammad; Al-Maadeed, Somaya; Bouridane, Ahmed; Beghdadi, Azeddine
2018-06-06
This work investigates the problem of detecting hazardous events on roads by designing an audio surveillance system that automatically detects perilous situations such as car crashes and tire skidding. In recent years, research has shown several visual surveillance systems that have been proposed for road monitoring to detect accidents with an aim to improve safety procedures in emergency cases. However, the visual information alone cannot detect certain events such as car crashes and tire skidding, especially under adverse and visually cluttered weather conditions such as snowfall, rain, and fog. Consequently, the incorporation of microphones and audio event detectors based on audio processing can significantly enhance the detection accuracy of such surveillance systems. This paper proposes to combine time-domain, frequency-domain, and joint time-frequency features extracted from a class of quadratic time-frequency distributions (QTFDs) to detect events on roads through audio analysis and processing. Experiments were carried out using a publicly available dataset. The experimental results conform the effectiveness of the proposed approach for detecting hazardous events on roads as demonstrated by 7% improvement of accuracy rate when compared against methods that use individual temporal and spectral features.
Detecting earthquakes over a seismic network using single-station similarity measures
NASA Astrophysics Data System (ADS)
Bergen, Karianne J.; Beroza, Gregory C.
2018-06-01
New blind waveform-similarity-based detection methods, such as Fingerprint and Similarity Thresholding (FAST), have shown promise for detecting weak signals in long-duration, continuous waveform data. While blind detectors are capable of identifying similar or repeating waveforms without templates, they can also be susceptible to false detections due to local correlated noise. In this work, we present a set of three new methods that allow us to extend single-station similarity-based detection over a seismic network; event-pair extraction, pairwise pseudo-association, and event resolution complete a post-processing pipeline that combines single-station similarity measures (e.g. FAST sparse similarity matrix) from each station in a network into a list of candidate events. The core technique, pairwise pseudo-association, leverages the pairwise structure of event detections in its network detection model, which allows it to identify events observed at multiple stations in the network without modeling the expected moveout. Though our approach is general, we apply it to extend FAST over a sparse seismic network. We demonstrate that our network-based extension of FAST is both sensitive and maintains a low false detection rate. As a test case, we apply our approach to 2 weeks of continuous waveform data from five stations during the foreshock sequence prior to the 2014 Mw 8.2 Iquique earthquake. Our method identifies nearly five times as many events as the local seismicity catalogue (including 95 per cent of the catalogue events), and less than 1 per cent of these candidate events are false detections.
Ohyama, Junji; Watanabe, Katsumi
2016-01-01
We examined how the temporal and spatial predictability of a task-irrelevant visual event affects the detection and memory of a visual item embedded in a continuously changing sequence. Participants observed 11 sequentially presented letters, during which a task-irrelevant visual event was either present or absent. Predictabilities of spatial location and temporal position of the event were controlled in 2 × 2 conditions. In the spatially predictable conditions, the event occurred at the same location within the stimulus sequence or at another location, while, in the spatially unpredictable conditions, it occurred at random locations. In the temporally predictable conditions, the event timing was fixed relative to the order of the letters, while in the temporally unpredictable condition; it could not be predicted from the letter order. Participants performed a working memory task and a target detection reaction time (RT) task. Memory accuracy was higher for a letter simultaneously presented at the same location as the event in the temporally unpredictable conditions, irrespective of the spatial predictability of the event. On the other hand, the detection RTs were only faster for a letter simultaneously presented at the same location as the event when the event was both temporally and spatially predictable. Thus, to facilitate ongoing detection processes, an event must be predictable both in space and time, while memory processes are enhanced by temporally unpredictable (i.e., surprising) events. Evidently, temporal predictability has differential effects on detection and memory of a visual item embedded in a sequence of images. PMID:26869966
Ohyama, Junji; Watanabe, Katsumi
2016-01-01
We examined how the temporal and spatial predictability of a task-irrelevant visual event affects the detection and memory of a visual item embedded in a continuously changing sequence. Participants observed 11 sequentially presented letters, during which a task-irrelevant visual event was either present or absent. Predictabilities of spatial location and temporal position of the event were controlled in 2 × 2 conditions. In the spatially predictable conditions, the event occurred at the same location within the stimulus sequence or at another location, while, in the spatially unpredictable conditions, it occurred at random locations. In the temporally predictable conditions, the event timing was fixed relative to the order of the letters, while in the temporally unpredictable condition; it could not be predicted from the letter order. Participants performed a working memory task and a target detection reaction time (RT) task. Memory accuracy was higher for a letter simultaneously presented at the same location as the event in the temporally unpredictable conditions, irrespective of the spatial predictability of the event. On the other hand, the detection RTs were only faster for a letter simultaneously presented at the same location as the event when the event was both temporally and spatially predictable. Thus, to facilitate ongoing detection processes, an event must be predictable both in space and time, while memory processes are enhanced by temporally unpredictable (i.e., surprising) events. Evidently, temporal predictability has differential effects on detection and memory of a visual item embedded in a sequence of images.
Arrowsmith, Stephen John; Young, Christopher J.; Ballard, Sanford; ...
2016-01-01
The standard paradigm for seismic event monitoring breaks the event detection problem down into a series of processing stages that can be categorized at the highest level into station-level processing and network-level processing algorithms (e.g., Le Bras and Wuster (2002)). At the station-level, waveforms are typically processed to detect signals and identify phases, which may subsequently be updated based on network processing. At the network-level, phase picks are associated to form events, which are subsequently located. Furthermore, waveforms are typically directly exploited only at the station-level, while network-level operations rely on earth models to associate and locate the events thatmore » generated the phase picks.« less
Event detection in an assisted living environment.
Stroiescu, Florin; Daly, Kieran; Kuris, Benjamin
2011-01-01
This paper presents the design of a wireless event detection and in building location awareness system. The systems architecture is based on using a body worn sensor to detect events such as falls where they occur in an assisted living environment. This process involves developing event detection algorithms and transmitting such events wirelessly to an in house network based on the 802.15.4 protocol. The network would then generate alerts both in the assisted living facility and remotely to an offsite monitoring facility. The focus of this paper is on the design of the system architecture and the compliance challenges in applying this technology.
Adaptively Adjusted Event-Triggering Mechanism on Fault Detection for Networked Control Systems.
Wang, Yu-Long; Lim, Cheng-Chew; Shi, Peng
2016-12-08
This paper studies the problem of adaptively adjusted event-triggering mechanism-based fault detection for a class of discrete-time networked control system (NCS) with applications to aircraft dynamics. By taking into account the fault occurrence detection progress and the fault occurrence probability, and introducing an adaptively adjusted event-triggering parameter, a novel event-triggering mechanism is proposed to achieve the efficient utilization of the communication network bandwidth. Both the sensor-to-control station and the control station-to-actuator network-induced delays are taken into account. The event-triggered sensor and the event-triggered control station are utilized simultaneously to establish new network-based closed-loop models for the NCS subject to faults. Based on the established models, the event-triggered simultaneous design of fault detection filter (FDF) and controller is presented. A new algorithm for handling the adaptively adjusted event-triggering parameter is proposed. Performance analysis verifies the effectiveness of the adaptively adjusted event-triggering mechanism, and the simultaneous design of FDF and controller.
Perceiving goals and actions in individuals with autism spectrum disorders.
Zalla, Tiziana; Labruyère, Nelly; Georgieff, Nicolas
2013-10-01
In the present study, we investigated the ability to parse familiar sequences of action into meaningful events in young individuals with autism spectrum disorders (ASDs), as compared to young individuals with typical development (TD) and young individuals with moderate mental retardation or learning disabilities (MLDs). While viewing two videotaped movies, participants were requested to detect the boundary transitions between component events at both fine and coarse levels of the action hierarchical structure. Overall, reduced accuracy for event detection was found in participants with ASDs, relative to participants with TD, at both levels of action segmentation. The performance was, however, equally diminished in participants with ASDs and MLDs under the course-grained segmentation suggesting that difficulties to detect fine-grained events in ASDs cannot be explained by a general intellectual dysfunction. Reduced accuracy for event detection was related to diminished event recall, memory for event sequence and Theory of Mind abilities. We hypothesized that difficulties with event detection result from a deficit disrupting the on-line processing of kinematic features and physical changes of dynamic human actions. An impairment at the earlier stages of the event encoding process might contribute to deficits in episodic memory and social functioning in individuals with ASDs.
Station Set Residual: Event Classification Using Historical Distribution of Observing Stations
NASA Astrophysics Data System (ADS)
Procopio, Mike; Lewis, Jennifer; Young, Chris
2010-05-01
Analysts working at the International Data Centre in support of treaty monitoring through the Comprehensive Nuclear-Test-Ban Treaty Organization spend a significant amount of time reviewing hypothesized seismic events produced by an automatic processing system. When reviewing these events to determine their legitimacy, analysts take a variety of approaches that rely heavily on training and past experience. One method used by analysts to gauge the validity of an event involves examining the set of stations involved in the detection of an event. In particular, leveraging past experience, an analyst can say that an event located in a certain part of the world is expected to be detected by Stations A, B, and C. Implicit in this statement is that such an event would usually not be detected by Stations X, Y, or Z. For some well understood parts of the world, the absence of one or more "expected" stations—or the presence of one or more "unexpected" stations—is correlated with a hypothesized event's legitimacy and to its survival to the event bulletin. The primary objective of this research is to formalize and quantify the difference between the observed set of stations detecting some hypothesized event, versus the expected set of stations historically associated with detecting similar nearby events close in magnitude. This Station Set Residual can be quantified in many ways, some of which are correlated with the analysts' determination of whether or not the event is valid. We propose that this Station Set Residual score can be used to screen out certain classes of "false" events produced by automatic processing with a high degree of confidence, reducing the analyst burden. Moreover, we propose that the visualization of the historically expected distribution of detecting stations can be immediately useful as an analyst aid during their review process.
Radiation detector device for rejecting and excluding incomplete charge collection events
Bolotnikov, Aleksey E.; De Geronimo, Gianluigi; Vernon, Emerson; Yang, Ge; Camarda, Giuseppe; Cui, Yonggang; Hossain, Anwar; Kim, Ki Hyun; James, Ralph B.
2016-05-10
A radiation detector device is provided that is capable of distinguishing between full charge collection (FCC) events and incomplete charge collection (ICC) events based upon a correlation value comparison algorithm that compares correlation values calculated for individually sensed radiation detection events with a calibrated FCC event correlation function. The calibrated FCC event correlation function serves as a reference curve utilized by a correlation value comparison algorithm to determine whether a sensed radiation detection event fits the profile of the FCC event correlation function within the noise tolerances of the radiation detector device. If the radiation detection event is determined to be an ICC event, then the spectrum for the ICC event is rejected and excluded from inclusion in the radiation detector device spectral analyses. The radiation detector device also can calculate a performance factor to determine the efficacy of distinguishing between FCC and ICC events.
Abnormal global and local event detection in compressive sensing domain
NASA Astrophysics Data System (ADS)
Wang, Tian; Qiao, Meina; Chen, Jie; Wang, Chuanyun; Zhang, Wenjia; Snoussi, Hichem
2018-05-01
Abnormal event detection, also known as anomaly detection, is one challenging task in security video surveillance. It is important to develop effective and robust movement representation models for global and local abnormal event detection to fight against factors such as occlusion and illumination change. In this paper, a new algorithm is proposed. It can locate the abnormal events on one frame, and detect the global abnormal frame. The proposed algorithm employs a sparse measurement matrix designed to represent the movement feature based on optical flow efficiently. Then, the abnormal detection mission is constructed as a one-class classification task via merely learning from the training normal samples. Experiments demonstrate that our algorithm performs well on the benchmark abnormal detection datasets against state-of-the-art methods.