Using Complexity-Identical Human- and Machine-Directed Utterances to Investigate Addressee Detection for Spoken Dialogue Systems

Oleg Akhtiamov; Ingo Siegert; Alexey Karpov; Wolfgang Minker

doi:10.3390/s20092740

Using Complexity-Identical Human- and Machine-Directed Utterances to Investigate Addressee Detection for Spoken Dialogue Systems

Sensors ◽

10.3390/s20092740 ◽

2020 ◽

Vol 20 (9) ◽

pp. 2740 ◽

Cited By ~ 1

Author(s):

Oleg Akhtiamov ◽

Ingo Siegert ◽

Alexey Karpov ◽

Wolfgang Minker

Keyword(s):

Data Augmentation ◽

Human Being ◽

Dialogue Systems ◽

Dialogue System ◽

Spoken Dialogue Systems ◽

Spoken Dialogue ◽

Spoken Dialogue System ◽

Phone Calls ◽

Factors Influencing ◽

Using Data

Human-machine addressee detection (H-M AD) is a modern paralinguistics and dialogue challenge that arises in multiparty conversations between several people and a spoken dialogue system (SDS) since the users may also talk to each other and even to themselves while interacting with the system. The SDS is supposed to determine whether it is being addressed or not. All existing studies on acoustic H-M AD were conducted on corpora designed in such a way that a human addressee and a machine played different dialogue roles. This peculiarity influences speakers’ behaviour and increases vocal differences between human- and machine-directed utterances. In the present study, we consider the Restaurant Booking Corpus (RBC) that consists of complexity-identical human- and machine-directed phone calls and allows us to eliminate most of the factors influencing speakers’ behaviour implicitly. The only remaining factor is the speakers’ explicit awareness of their interlocutor (technical system or human being). Although complexity-identical H-M AD is essentially more challenging than the classical one, we managed to achieve significant improvements using data augmentation (unweighted average recall (UAR) = 0.628) over native listeners (UAR = 0.596) and a baseline classifier presented by the RBC developers (UAR = 0.539).

Get full-text (via PubEx)

Characterizing and Predicting Corrections in Spoken Dialogue Systems

Computational Linguistics ◽

10.1162/coli.2006.32.3.417 ◽

2006 ◽

Vol 32 (3) ◽

pp. 417-438 ◽

Cited By ~ 19

Author(s):

Diane Litman ◽

Julia Hirschberg ◽

Marc Swerts

Keyword(s):

Speech Recognition ◽

Predictive Power ◽

Classification Error ◽

Dialogue Systems ◽

Dialogue System ◽

Spoken Dialogue Systems ◽

Experimental Conditions ◽

Spoken Dialogue ◽

Spoken Dialogue System ◽

Recognition Errors

This article focuses on the analysis and prediction of corrections, defined as turns where a user tries to correct a prior error made by a spoken dialogue system. We describe our labeling procedure of various corrections types and statistical analyses of their features in a corpus collected from a train information spoken dialogue system. We then present results of machine-learning experiments designed to identify user corrections of speech recognition errors. We investigate the predictive power of features automatically computable from the prosody of the turn, the speech recognition process, experimental conditions, and the dialogue history. Our best-performing features reduce classification error from baselines of 25.70–28.99% to 15.72%.

Get full-text (via PubEx)

Spoken Dialogue System: Its Applications in the Developing Countries and a Technology for Bridging the Digital Divide and Augmenting Scarce Services in those Countries

Asian Journal of Research in Computer Science ◽

10.9734/ajrcos/2019/v4i230110 ◽

2019 ◽

pp. 1-11

Author(s):

Oyelami Olufemi Moses

Keyword(s):

Developing Countries ◽

Digital Divide ◽

Developing World ◽

Developing Nations ◽

Dialogue Systems ◽

Dialogue System ◽

Spoken Dialogue Systems ◽

Spoken Dialogue ◽

Spoken Dialogue System ◽

The World

Aims: This article reports the various application areas of the spoken dialogue system in the developing world to determine if the system could be used to bridge the digital divide prevalent in these regions of the world. The work also aims to identify in which developing nations is the system currently being put to use. Study Design: A survey of twenty articles on the subject matter was carried out and their domains of the application were identified. The different forms of the evaluation carried out on them were also identified towards determining their outcomes positivity for bridging the digital divide. Various comments made of the different evaluations were also considered in determining the suitability of spoken dialogue systems in bridging the digital divide. Place and Duration of Study: Department of Computer Science and Information Technology, Bowen University, Iwo, Nigeria, between February 2013 and October 2019. Methodology: The different domains of the works, the different forms of the evaluation carried out on the systems, the various comments consequent upon the testing of the systems by the participants and the developing countries where those works were carried out were identified. A position was now taken based on the results obtained. Results: Nine of the works are in the healthcare domain, three in agriculture, one in banking, one in aviation, one in secretarial work, one in the accuracy of recognition, one in education and three having multiple domains. The various comments and results from the evaluations all point towards the system’s suitability for bridging the digital divide. The spoken dialogue system is currently being used in only six developing nations of the world. Conclusion: Consequent upon the results obtained, it is clear that spoken dialogue systems can be used to bridge the digital divide in the developing world and that other application areas not yet covered could be explored for the benefits of the citizens of these regions, especially the digitally disadvantaged ones.

Get full-text (via PubEx)

‘Can I Trust the Spoken Dialogue System Because It Uses the Same Words as I Do?’—Influence of Lexically Aligned Spoken Dialogue Systems on Trustworthiness and User Satisfaction

Interacting with Computers ◽

10.1093/iwc/iwy005 ◽

2018 ◽

Vol 30 (3) ◽

pp. 173-186 ◽

Cited By ~ 1

Author(s):

Gesa Alena Linnemann ◽

Regina Jucks

Keyword(s):

User Satisfaction ◽

Dialogue Systems ◽

Dialogue System ◽

Spoken Dialogue Systems ◽

Spoken Dialogue ◽

Spoken Dialogue System

Get full-text (via PubEx)

Robust grammatical analysis for spoken dialogue systems

Natural Language Engineering ◽

10.1017/s1351324999002156 ◽

1999 ◽

Vol 5 (1) ◽

pp. 45-93 ◽

Cited By ~ 12

Author(s):

GERTJAN VAN NOORD ◽

GOSSE BOUMA ◽

ROB KOELING ◽

MARK-JAN NEDERHOF

Keyword(s):

Viable Alternative ◽

Dialogue Systems ◽

Test Results ◽

Sources Of Information ◽

Dialogue System ◽

Spoken Dialogue Systems ◽

Spoken Dialogue ◽

Spoken Dialogue System ◽

Grammatical Analysis ◽

Robust Parsing

We argue that grammatical analysis is a viable alternative to concept spotting for processing spoken input in a practical spoken dialogue system. We discuss the structure of the grammar, and a model for robust parsing which combines linguistic sources of information and statistical sources of information. We discuss test results suggesting that grammatical processing allows fast and accurate processing of spoken input.

Get full-text (via PubEx)

Automatically Training a Problematic Dialogue Predictor for a Spoken Dialogue System

Journal of Artificial Intelligence Research ◽

10.1613/jair.971 ◽

2002 ◽

Vol 16 ◽

pp. 293-319 ◽

Cited By ~ 29

Author(s):

M. A. Walker ◽

I. Langkilde-Geary ◽

H. Wright Hastie ◽

J. Wright ◽

A. Gorin

Keyword(s):

Information Sources ◽

Dialogue Systems ◽

Dialogue System ◽

Spoken Dialogue Systems ◽

Spoken Dialogue ◽

Spoken Dialogue System ◽

Customer Care ◽

Dialogue Manager

Spoken dialogue systems promise efficient and natural access to a large variety of information sources and services from any phone. However, current spoken dialogue systems are deficient in their strategies for preventing, identifying and repairing problems that arise in the conversation. This paper reports results on automatically training a Problematic Dialogue Predictor to predict problematic human-computer dialogues using a corpus of 4692 dialogues collected with the 'How May I Help You' (SM) spoken dialogue system. The Problematic Dialogue Predictor can be immediately applied to the system's decision of whether to transfer the call to a human customer care agent, or be used as a cue to the system's dialogue manager to modify its behavior to repair problems, and even perhaps, to prevent them. We show that a Problematic Dialogue Predictor using automatically-obtainable features from the first two exchanges in the dialogue can predict problematic dialogues 13.2% more accurately than the baseline.

Get full-text (via PubEx)

Design and Development of an Automated Voice Agent

Conversational Agents and Natural Language Interaction ◽

10.4018/978-1-60960-617-6.ch015 ◽

2011 ◽

pp. 335-357

Author(s):

Pepi Stavropoulou ◽

Dimitris Spiliotopoulos ◽

Georgios Kouroupetroglou

Keyword(s):

Customer Service ◽

Real Life ◽

Dialogue Systems ◽

Dialogue System ◽

Design And Development ◽

Spoken Dialogue Systems ◽

Spoken Dialogue ◽

Spoken Dialogue System ◽

User Input ◽

Dialogue Structure

Sophisticated, commercially deployed spoken dialogue systems capable of engaging in more natural human-machine conversation have increased in number over the past years. Besides employing advanced interpretation and dialogue management technologies, the success of such systems greatly depends on effective design and development methodology. There is, actually, a widely acknowledged, fundamentally reciprocal relationship between technologies used and design choices. In this line of thought, this chapter constitutes a more practical approach to spoken dialogue system development, comparing design methods and implementation tools highly suited for industry oriented spoken dialogue systems, and commenting on their interdependencies, in order to facilitate the developer’s choice of the optimal tools and methodologies. The latter are presented and assessed in the light of AVA, a real-life Automated Voice Agent that performs call routing and customer service tasks, employing advanced stochastic techniques for interpretation and allowing for free form user input and less rigid dialogue structure.

Get full-text (via PubEx)

Student Evaluations of a (Rude) Spoken Dialogue System Insights from an Experimental Study

Advances in Human-Computer Interaction ◽

10.1155/2018/8406187 ◽

2018 ◽

Vol 2018 ◽

pp. 1-10

Author(s):

Regina Jucks ◽

Gesa A. Linnemann ◽

Benjamin Brummernhenrich

Keyword(s):

Experimental Study ◽

Young Adults ◽

Student Evaluations ◽

Dialogue Systems ◽

Dialogue System ◽

Spoken Dialogue Systems ◽

Spoken Dialogue ◽

Spoken Dialogue System ◽

Word Use ◽

Practical Implications

Communicating with spoken dialogue systems (SDS) such as Apple’s Siri® and Google’s Now is becoming more and more common. We report a study that manipulates an SDS’s word use with regard to politeness. In an experiment, 58 young adults evaluated the spoken messages of our self-developed SDS as it replied to typical questions posed by university freshmen. The answers were either formulated politely or rudely. Dependent measures were both holistic measures of how students perceived the SDS as well as detailed evaluations of each single answer. Results show that participants not only evaluated the content of rude answers as being less appropriate and less pleasant than the polite answers, but also evaluated the rude system as less accurate. Lack of politeness also impacted aspects of the perceived trustworthiness of the SDS. We conclude that users of SDS expect such systems to be polite, and we then discuss some practical implications for designing SDS.

Get full-text (via PubEx)

A Neural Network Approach to Intention Modeling for User-Adapted Conversational Agents

Computational Intelligence and Neuroscience ◽

10.1155/2016/8402127 ◽

2016 ◽

Vol 2016 ◽

pp. 1-11

Author(s):

David Griol ◽

Zoraida Callejas

Keyword(s):

Positive Influence ◽

Conversational Agents ◽

Dialogue System ◽

Spoken Dialogue Systems ◽

Neural Network Approach ◽

Spoken Dialogue ◽

Spoken Dialogue System ◽

Dialogue Model ◽

Computer Interfaces ◽

Human Computer Interfaces

Spoken dialogue systems have been proposed to enable a more natural and intuitive interaction with the environment and human-computer interfaces. In this contribution, we present a framework based on neural networks that allows modeling of the user’s intention during the dialogue and uses this prediction to dynamically adapt the dialogue model of the system taking into consideration the user’s needs and preferences. We have evaluated our proposal to develop a user-adapted spoken dialogue system that facilitates tourist information and services and provide a detailed discussion of the positive influence of our proposal in the success of the interaction, the information and services provided, and the quality perceived by the users.

Get full-text (via PubEx)

An Application of Reinforcement Learning to Dialogue Strategy Selection in a Spoken Dialogue System for Email

Journal of Artificial Intelligence Research ◽

10.1613/jair.713 ◽

2000 ◽

Vol 12 ◽

pp. 387-416 ◽

Cited By ~ 56

Author(s):

M. A. Walker

Keyword(s):

Reinforcement Learning ◽

Performance Modeling ◽

Evaluation Framework ◽

Interactive System ◽

Strategy Selection ◽

Dialogue System ◽

Spoken Dialogue Systems ◽

Spoken Dialogue ◽

Spoken Dialogue System ◽

And Performance

This paper describes a novel method by which a spoken dialogue system can learn to choose an optimal dialogue strategy from its experience interacting with human users. The method is based on a combination of reinforcement learning and performance modeling of spoken dialogue systems. The reinforcement learning component applies Q-learning (Watkins, 1989), while the performance modeling component applies the PARADISE evaluation framework (Walker et al., 1997) to learn the performance function (reward) used in reinforcement learning. We illustrate the method with a spoken dialogue system named ELVIS (EmaiL Voice Interactive System), that supports access to email over the phone. We conduct a set of experiments for training an optimal dialogue strategy on a corpus of 219 dialogues in which human users interact with ELVIS over the phone. We then test that strategy on a corpus of 18 dialogues. We show that ELVIS can learn to optimize its strategy selection for agent initiative, for reading messages, and for summarizing email folders.

Get full-text (via PubEx)

SemEval-2014 Task 2: Grammar Induction for Spoken Dialogue Systems

10.3115/v1/s14-2002 ◽

2014 ◽

Author(s):

Ioannis Klasinas ◽

Elias Iosif ◽

Katerina Louka ◽

Alexandros Potamianos

Keyword(s):

Dialogue Systems ◽

Grammar Induction ◽

Spoken Dialogue Systems ◽

Spoken Dialogue

Get full-text (via PubEx)