scholarly journals How Am I Doing?: Evaluating Conversational Search Systems Offline

2021 ◽  
Vol 39 (4) ◽  
pp. 1-22
Author(s):  
Aldo Lipani ◽  
Ben Carterette ◽  
Emine Yilmaz

As conversational agents like Siri and Alexa gain in popularity and use, conversation is becoming a more and more important mode of interaction for search. Conversational search shares some features with traditional search, but differs in some important respects: conversational search systems are less likely to return ranked lists of results (a SERP), more likely to involve iterated interactions, and more likely to feature longer, well-formed user queries in the form of natural language questions. Because of these differences, traditional methods for search evaluation (such as the Cranfield paradigm) do not translate easily to conversational search. In this work, we propose a framework for offline evaluation of conversational search, which includes a methodology for creating test collections with relevance judgments, an evaluation measure based on a user interaction model, and an approach to collecting user interaction data to train the model. The framework is based on the idea of “subtopics”, often used to model novelty and diversity in search and recommendation, and the user model is similar to the geometric browsing model introduced by RBP and used in ERR. As far as we know, this is the first work to combine these ideas into a comprehensive framework for offline evaluation of conversational search.

2007 ◽  
Vol 22 (4) ◽  
pp. 361-377 ◽  
Author(s):  
VICTORIA UREN ◽  
YUANGUI LEI ◽  
VANESSA LOPEZ ◽  
HAIMING LIU ◽  
ENRICO MOTTA ◽  
...  

AbstractThe goal of semantic search is to improve on traditional search methods by exploiting the semantic metadata. In this paper, we argue that supporting iterative and exploratory search modes is important to the usability of all search systems. We also identify the types of semantic queries the users need to make, the issues concerning the search environment and the problems that are intrinsic to semantic search in particular. We then review the four modes of user interaction in existing semantic search systems, namely keyword-based, form-based, view-based and natural language-based systems. Future development should focus on multimodal search systems, which exploit the advantages of more than one mode of interaction, and on developing the search systems that can search heterogeneous semantic metadata on the open semantic Web.


Author(s):  
Ronnie W. Smith ◽  
D. Richard Hipp

As spoken natural language dialog systems technology continues to make great strides, numerous issues regarding dialog processing still need to be resolved. This book presents an exciting new dialog processing architecture that allows for a number of behaviors required for effective human-machine interactions, including: problem-solving to help the user carry out a task, coherent subdialog movement during the problem-solving process, user model usage, expectation usage for contextual interpretation and error correction, and variable initiative behavior for interacting with users of differing expertise. The book also details how different dialog problems in processing can be handled simultaneously, and provides instructions and in-depth result from pertinent experiments. Researchers and professionals in natural language systems will find this important new book an invaluable addition to their libraries.


2021 ◽  
Vol 27 (1) ◽  
pp. 146045822199486
Author(s):  
Nicholas RJ Frick ◽  
Felix Brünker ◽  
Björn Ross ◽  
Stefan Stieglitz

Within the anamnesis, medical information is frequently withheld, incomplete, or incorrect, potentially causing negative consequences for the patient. The use of conversational agents (CAs), computer-based systems using natural language to interact with humans, may mitigate this problem. The present research examines whether CAs differ from physicians in their ability to elicit truthful disclosure and discourage concealment of medical information. We conducted an online questionnaire with German participants ( N = 148) to assess their willingness to reveal medical information. The results indicate that patients would rather disclose medical information to a physician than to a CA; there was no difference in the tendency to conceal information. This research offers a frame of reference for future research on applying CAs during the anamnesis to support physicians. From a practical view, physicians might gain better understanding of how the use of CAs can facilitate the anamnesis.


2017 ◽  
Vol 13 (4) ◽  
pp. 1989-1999 ◽  
Author(s):  
Fabrizio Lamberti ◽  
Gianluca Paravati ◽  
Valentina Gatteschi ◽  
Alberto Cannavo

2020 ◽  
Vol 10 (3) ◽  
pp. 762
Author(s):  
Erinc Merdivan ◽  
Deepika Singh ◽  
Sten Hanke ◽  
Johannes Kropf ◽  
Andreas Holzinger ◽  
...  

Conversational agents are gaining huge popularity in industrial applications such as digital assistants, chatbots, and particularly systems for natural language understanding (NLU). However, a major drawback is the unavailability of a common metric to evaluate the replies against human judgement for conversational agents. In this paper, we develop a benchmark dataset with human annotations and diverse replies that can be used to develop such metric for conversational agents. The paper introduces a high-quality human annotated movie dialogue dataset, HUMOD, that is developed from the Cornell movie dialogues dataset. This new dataset comprises 28,500 human responses from 9500 multi-turn dialogue history-reply pairs. Human responses include: (i) ratings of the dialogue reply in relevance to the dialogue history; and (ii) unique dialogue replies for each dialogue history from the users. Such unique dialogue replies enable researchers in evaluating their models against six unique human responses for each given history. Detailed analysis on how dialogues are structured and human perception on dialogue score in comparison with existing models are also presented.


Author(s):  
Daniel Scherer ◽  
Ademar V. Netto ◽  
Yuska P. C. Aguiar ◽  
Maria de Fátima Q. Vieira

In order to prevent human error, it is essential to understand the nature of the user’s behaviour. This chapter proposes a combined approach to increase knowledge of user behaviour by instantiating a programmable user model with data gathered from a user profile. Together, the user profile and user model represent, respectively, the static and dynamic characteristics of user behaviour. Typically, user models have been employed by system designers to explore the user decision-making process and its implications, since user profiles do not account for the dynamic aspects of a user interaction. In this chapter, the user profile and model are employed to study human errors—supporting an investigation of the relationship between user errors and user characteristics. The chapter reviews the literature on user profiles and models and presents the proposed user profile and model. It concludes by discussing the application of the proposed approach in the context of electrical systems’ operation.


2012 ◽  
pp. 969-985
Author(s):  
Floriana Esposito ◽  
Teresa M.A. Basile ◽  
Nicola Di Mauro ◽  
Stefano Ferilli

One of the most important features of a mobile device concerns its flexibility and capability to adapt the functionality it provides to the users. However, the main problems of the systems present in literature are their incapability to identify user needs and, more importantly, the insufficient mappings of those needs to available resources/services. In this paper, we present a two-phase construction of the user model: firstly, an initial static user model is built for the user connecting to the system the first time. Then, the model is revised/adjusted by considering the information collected in the logs of the user interaction with the device/context in order to make the model more adequate to the evolving user’s interests/ preferences/behaviour. The initial model is built by exploiting the stereotype concept, its adjustment is performed exploiting machine learning techniques and particularly, sequence mining and pattern discovery strategies.


Sign in / Sign up

Export Citation Format

Share Document