scholarly journals Perception in Black and White: Effects of Intonational Variables and Filtering Conditions on Sociolinguistic Judgments With Implications for ASR

2021 ◽  
Vol 4 ◽  
Author(s):  
Nicole R. Holliday

This study tests the effects of intonational contours and filtering conditions on listener judgments of ethnicity to arrive at a more comprehensive understanding on how prosody influences these judgments, with implications for austomatic speech recognition systems as well as speech synthesis. In a perceptual experiment, 40 American English listeners heard phrase-long clips which were controlled for pitch accent type and focus marking. Each clip contained either two H* (high) or two L+H* (low high) pitch accents and a L-L% (falling) boundary tone, and had also previously been labelled for broad or narrow focus. Listeners rated clips in two tasks, one with unmodified stimuli and one with stimuli lowpass filtered at 400 Hz, and were asked to judge whether the speaker was “Black” or “White”. In the filtered condition, tokens with the L+H* pitch accent were more likely to be rated as “Black”, with an interaction such that broad focus enhanced this pattern, supporting earlier findings that listeners may perceive African American Language as having more variation in possible pitch accent meanings. In the unfiltered condition, tokens with the L+H* pitch accent were less likely to be rated as Black, with no effect of focus, likely due to the fact that listeners relied more heavily on available segmental information in this condition. These results enhance our understanding of cues listeners rely on in making social judgments about speakers, especially in ethnic identification and linguistic profiling, by highlighting perceptual differences due to listening environment as well as predicted meaning of specific intonational contours. They also contribute to our understanding of the role of how human listeners interpret meaning within a holistic context, which has implications for the construction of computational systems designed to replicate the properties of natural language. In particular, they have important applicability to speech synthesis and speech recognition programs, which are often limited in their capacities due to the fact that they do not make such holistic sociolinguistic considerations of the meanings of input or output speech.

2011 ◽  
Vol 23 (9) ◽  
pp. 2447-2467 ◽  
Author(s):  
Sara Bögels ◽  
Herbert Schriefers ◽  
Wietske Vonk ◽  
Dorothee J. Chwilla

The present study addresses the question whether accentuation and prosodic phrasing can have a similar function, namely, to group words in a sentence together. Participants listened to locally ambiguous sentences containing object- and subject-control verbs while ERPs were measured. In Experiment 1, these sentences contained a prosodic break, which can create a certain syntactic grouping of words, or no prosodic break. At the disambiguation, an N400 effect occurred when the disambiguation was in conflict with the syntactic grouping created by the break. We found a similar N400 effect without the break, indicating that the break did not strengthen an already existing preference. This pattern held for both object- and subject-control items. In Experiment 2, the same sentences contained a break and a pitch accent on the noun following the break. We argue that the pitch accent indicates a broad focus covering two words [see Gussenhoven, C. On the limits of focus projection in English. In P. Bosch & R. van der Sandt (Eds.), Focus: Linguistic, cognitive, and computational perspectives. Cambridge: University Press, 1999], thus grouping these words together. For object-control items, this was semantically possible, which led to a “good-enough” interpretation of the sentence. Therefore, both sentences were interpreted equally well and the N400 effect found in Experiment 1 was absent. In contrast, for subject-control items, a corresponding grouping of the words was impossible, both semantically and syntactically, leading to processing difficulty in the form of an N400 effect and a late positivity. In conclusion, accentuation can group words together on the level of information structure, leading to either a semantically “good-enough” interpretation or a processing problem when such a semantic interpretation is not possible.


Author(s):  
Robin Karlin

Featural and gestural models of tone differ on the degree to which they include timing information in the representation. However, both assume some kind of simultaneity between tones and their tone-bearing units, where featural models emphasize the role of acoustic relationships and gestural models instead emphasize articulatory coordination. We present the results of two acoustic production studies on two dialects of Serbian, a lexical pitch accent language. In the Belgrade dialect, pitch accents are aligned relatively late in the tone-bearing unit, while in the Valjevo dialect, pitch accents are phonetically retracted, sometimes into the preceding syllable. We varied the syllable onsets of tone-bearing units in falling (experiment 1) and rising (experiment 2) pitch accents, and measured the effects on F0 contours. Despite these differences in phonetic alignment, the phonological system is the same in both dialects. We argue that this apparent mismatch between the phonology and phonetics can be expressed straightforwardly in the Articulatory Phonology framework by allowing tone gestures to coordinate with other gestures in all the ways that segmental gestures can, rather than restricting tone to c-center coordination.


2020 ◽  
Author(s):  
Robin Karlin

Featural and gestural models of tone differ on the degree to which they include timing information in the representation. However, both assume some kind of simultaneity between tones and their tone-bearing units, where featural models emphasize the role of acoustic relationships and gestural models instead emphasize articulatory coordination. We present the results of two acoustic production studies on two dialects of Serbian, a lexical pitch accent language. In the Belgrade dialect, pitch accents are aligned relatively late in the tone-bearing unit, while in the Valjevo dialect, pitch accents are phonetically retracted, sometimes into the preceding syllable. We varied the syllable onsets of tone-bearing units in falling (experiment 1) and rising (experiment 2) pitch accents, and measured the effects on F0 contours. Despite these differences in phonetic alignment, the phonological system is the same in both dialects. We argue that this apparent mismatch between the phonology and phonetics can be expressed straightforwardly in the Articulatory Phonology framework by allowing tone gestures to coordinate with other gestures in all the ways that segmental gestures can, rather than restricting tone to c-center coordination.


2016 ◽  
Vol 60 (2) ◽  
pp. 260-288 ◽  
Author(s):  
Frank Kügler ◽  
Caroline Féry

This article is a follow-up study of Féry and Kügler (2008. Pitch accent scaling on given, new and focused constituents in German. Journal of Phonetics, 36, 680–703). It reports on an experiment of the F0 height of potential pitch accents in the postfocal region of German sentences and addresses in this way an aspect of the influence of information structure on the intonation of sentences that was left open in the previous article. The results of the experiment showed that, when several constituents are located in this position, they are often in a downstep relation, but are rarely upstepped. In 37% of the cases, the pitch accents are only realized dynamically and there is no down- or upstepping. We interpret these results as evidence that postfocal constituents are phrased independently. The data examined speak against a model of postfocal intonation in which postfocal phrasing is eliminated and all accents are reduced to zero. Instead, the pitch accents are often present, although reduced. Moreover, the facts support the existence of prosodic phrasing of the postfocal constituents; the postfocal position implies an extremely compressed register, but no dephrasing or systematic complete deaccentuation of all pitch accents. We propose adopting a model of German intonation in which prosodic phrasing is determined by syntactic structure and cannot be changed by information structure. The role of information structure in prosody is limited to changes in the register relationship of the different parts of the sentence. Prefocally, there is no or only little register compression because of givenness. Postfocally, register compression is the rule. A model of intonation must take this asymmetry into account.


2017 ◽  
Vol 61 (3) ◽  
pp. 480-512 ◽  
Author(s):  
Jesse A. Harris ◽  
Katy Carlson

We compare the roles of overt accent and default focus marking in processing ellipsis structures headed by focus-sensitive coordinators (such as Danielle couldn’t pass the quiz, let alone the final/Kayla). In a small auditory corpus study of radio transcripts, we establish that such structures overwhelmingly occur with contrastive pitch accents on the correlate and remnant ( the quiz and the final, or Danielle and Kayla), and that there is a strong bias to pair the remnant with the most local plausible correlate in production. In two auditory naturalness ratings experiments, we observe that marking a non-local correlate with contrastive pitch accent moderates, but does not fully overturn, the bias for local correlates in comprehension. We propose that the locality preference is due to a sentence-final default position for sentence accent, and that auditory processing is subject to “enduring focus,” in which default positions for focus continue to influence the focus structure of the sentence even in the presence of overt accents. The importance of these results for models of auditory processing and of the processing of remnants in ellipsis structures is discussed.


2018 ◽  
Author(s):  
Simon Roessig ◽  
Doris Mücke ◽  
Martine Grice

The framework of dynamical systems offers powerful tools to understand the relation between stability and variability in human cognition in general and in speech in particular. In the current paper, we propose a dynamical systems approach to the description of German nuclear pitch accents in focus marking to account for both the categorical as well as the continuous variation found in intonational data. We report on results from 27 native speakers and employ an attractor landscape to represent pitch accent types in terms of f0 measures in a continuous dimension. We demonstrate how the same system can account for both the categorical variation (relative stability of one prosodic category) as well as the continuous variation (detailed modifications within one prosodic category). The model is able to capture the qualitative aspects of focus marking such as falling vs. rising pitch accent types as well as the quantitative aspects such as less rising vs. more rising accents in one system by means of scaling a single parameter. Furthermore, speaker group specific strategies are analysed and modelled as differences in the scaling of this parameter. Thus, the model contributes to the ongoing debate about the relation between phonetics and phonology and the importance of variation in language and speech.


Author(s):  
Utkarsh Kumar ◽  
Anil Kumar Gope ◽  
Shweta Singh

In India, the position of mobile banking was in saga and this time, it is in pic position. The speedof reaching the people is going high and high. This is time of wireless world and sense of prestige; no doubt the mobile commerce is contributing to enhance the beauty of life and playing the role of metaphor and has become the part and parcel of our life. This growth has changed people to do business in mobile commerce (М- Commerce). Peoples are transferring to M-Commerce to attain good and fast transaction into market and saving their precious time. M-Commerce has become distinguished in Indian people, quickly during last few years. Due to large number of mobile application, growth rate in mobile penetration in India is increasing with the rapid speed. The mobile users has shifted to use the android phone from simple and black and white phone and taking the service of internet, the role of telecom companies is also important in the being popular of mobile commerce. Although many people have started E-Commerce but still a separate part of the society feel uncomfortable and hesitate to use M-Commerce because of security problems, payment issues and complexity of mobile applications. This paper identifies facts about the feasibility of MCommercein India today its growth and the Strength and opportunity, weakness and threats lying ahead.


2021 ◽  
Vol 56 (1) ◽  
pp. 18-33
Author(s):  
Lucky Mathebe

After almost 25 years of what could justifiably be called transformative change in South Africa, a truism is that the country’s new legal order, established by the Constitution in 1993 and 1996, provides the critical foundation of peace and security upon which its freedom has been built. The Constitutional Court was one of the most important of the new democratic institutions in the shaping of the country’s position as a constitutional democracy, upholding the values for which millions of people, black and white, had fought. This article is a brief reflection on the role of the Court in establishing the meaning of this democracy and giving it effect. The main goal of the article is to understand how the Court’s new jurisprudence works in particular contexts, how its work is related to crime and punishment, and what it means for the rights of marginalised groups in society. Using the examples of the Court’s decision in Makwanyane on the death penalty, and the Court’s decision on the findings of the Public Protector’s report on Nkandla, the article finds that the Court’s new jurisprudence takes quite a different view of legal developments in South Africa, insofar as the jurisprudence entrusts broad discretion to the Court and emphasises the need for sustained leadership of the Court to advance the battle for fundamental human rights, the rule of law, and democratic accountability.


Author(s):  
Anouschka Foltz

Abstract While monolingual speakers can use contrastive pitch accents to predict upcoming referents, bilingual speakers do not always use this cue predictively in their L2. The current study examines the role of recent exposure for predictive processing in native German (L1) second language learners of English (L2). In Experiment 1, participants followed instructions to click on two successive objects, for example, Click on the red carrot/duck. Click on the green/GREEN carrot (where CAPS indicate a contrastive L + H* accent). Participants predicted a repeated noun following a L + H* accent in the L1, but not in the L2, where processing was delayed. Experiment 2 shows that after an exposure period with highly consistent prosodic cues, bilinguals engaged in predictive processing in both their L1 and L2. However, inconsistent prosodic cues showed different effects on bilinguals’ L1 and L2 predictive processing. The results are discussed in terms of exposure-based and resource-deficit models of processing.


Sign in / Sign up

Export Citation Format

Share Document