Real-Time Language Processing as Embodied and Embedded in Joint Action

Putting up barriers

10.1093/oso/9780198785156.003.0001 ◽

2017 ◽

Author(s):

David J. Lobina

Keyword(s):

Knowledge Base ◽

Real Time ◽

Language Processing ◽

Cognitive Domain ◽

Linguistic Knowledge ◽

Levels Of Explanation ◽

Real Time Processing ◽

Time Processing ◽

Neural Underpinnings ◽

Different Levels

The study of cognitive phenomena is best approached in an orderly manner. It must begin with an analysis of the function in intension at the heart of any cognitive domain (its knowledge base), then proceed to the manner in which such knowledge is put into use in real-time processing, concluding with a domain’s neural underpinnings, its development in ontogeny, etc. Such an approach to the study of cognition involves the adoption of different levels of explanation/description, as prescribed by David Marr and many others, each level requiring its own methodology and supplying its own data to be accounted for. The study of recursion in cognition is badly in need of a systematic and well-ordered approach, and this chapter lays out the blueprint to be followed in the book by focusing on a strict separation between how this notion applies in linguistic knowledge and how it manifests itself in language processing.

Download Full-text

Limits on expectation-based processing: Use of grammatical aspect for co-reference in L2

Applied Psycholinguistics ◽

10.1017/s0142716420000582 ◽

2020 ◽

pp. 1-25

Author(s):

Theres Grüter ◽

Hannah Rohde

Keyword(s):

Real Time ◽

Language Processing ◽

Judgment Task ◽

Linguistic Knowledge ◽

Real Time Processing ◽

Time Processing ◽

Grammatical Aspect ◽

Level Information ◽

Native Speakers Of English ◽

Reference Processing

Abstract This study examines the use of discourse-level information to create expectations about reference in real-time processing, testing whether patterns previously observed among native speakers of English generalize to nonnative speakers. Findings from a visual-world eye-tracking experiment show that native (L1; N = 53) but not nonnative (L2; N = 52) listeners’ proactive coreference expectations are modulated by grammatical aspect in transfer-of-possession events. Results from an offline judgment task show these L2 participants did not differ from L1 speakers in their interpretation of aspect marking on transfer-of-possession predicates in English, indicating it is not lack of linguistic knowledge but utilization of this knowledge in real-time processing that distinguishes the groups. English proficiency, although varying substantially within the L2 group, did not modulate L2 listeners’ use of grammatical aspect for reference processing. These findings contribute to the broader endeavor of delineating the role of prediction in human language processing in general, and in the processing of discourse-level information among L2 users in particular.

Download Full-text

What's my App?

ACM SIGMETRICS Performance Evaluation Review ◽

10.1145/3466826.3466841 ◽

2021 ◽

Vol 48 (4) ◽

pp. 41-44

Author(s):

Dena Markudova ◽

Martino Trevisan ◽

Paolo Garza ◽

Michela Meo ◽

Maurizio M. Munafo ◽

...

Keyword(s):

Machine Learning ◽

Natural Language Processing ◽

Real Time ◽

Language Processing ◽

Traffic Management ◽

Quality Of Experience ◽

Broadband Internet ◽

Management Policies ◽

Control Traffic

With the spread of broadband Internet, Real-Time Communication (RTC) platforms have become increasingly popular and have transformed the way people communicate. Thus, it is fundamental that the network adopts traffic management policies that ensure appropriate Quality of Experience to users of RTC applications. A key step for this is the identification of the applications behind RTC traffic, which in turn allows to allocate adequate resources and make decisions based on the specific application's requirements. In this paper, we introduce a machine learning-based system for identifying the traffic of RTC applications. It builds on the domains contacted before starting a call and leverages techniques from Natural Language Processing (NLP) to build meaningful features. Our system works in real-time and is robust to the peculiarities of the RTP implementations of different applications, since it uses only control traffic. Experimental results show that our approach classifies 5 well-known meeting applications with an F1 score of 0.89.

Download Full-text

A guided latent Dirichlet allocation approach to investigate real-time latent topics of Twitter data during Hurricane Laura

Journal of Information Science ◽

10.1177/01655515211007724 ◽

2021 ◽

pp. 016555152110077

Author(s):

Sulong Zhou ◽

Pengyu Kan ◽

Qunying Huang ◽

Janet Silbernagel

Keyword(s):

Social Media ◽

Real Time ◽

Language Processing ◽

Disaster Response ◽

Domain Knowledge ◽

Latent Dirichlet Allocation ◽

Situational Awareness ◽

High Performing ◽

Latent Topics ◽

Dirichlet Allocation

Natural disasters cause significant damage, casualties and economical losses. Twitter has been used to support prompt disaster response and management because people tend to communicate and spread information on public social media platforms during disaster events. To retrieve real-time situational awareness (SA) information from tweets, the most effective way to mine text is using natural language processing (NLP). Among the advanced NLP models, the supervised approach can classify tweets into different categories to gain insight and leverage useful SA information from social media data. However, high-performing supervised models require domain knowledge to specify categories and involve costly labelling tasks. This research proposes a guided latent Dirichlet allocation (LDA) workflow to investigate temporal latent topics from tweets during a recent disaster event, the 2020 Hurricane Laura. With integration of prior knowledge, a coherence model, LDA topics visualisation and validation from official reports, our guided approach reveals that most tweets contain several latent topics during the 10-day period of Hurricane Laura. This result indicates that state-of-the-art supervised models have not fully utilised tweet information because they only assign each tweet a single label. In contrast, our model can not only identify emerging topics during different disaster events but also provides multilabel references to the classification schema. In addition, our results can help to quickly identify and extract SA information to responders, stakeholders and the general public so that they can adopt timely responsive strategies and wisely allocate resource during Hurricane events.

Download Full-text

The CoRisk-Index: Measuring economic risks related to COVID-19 in real time

10.21203/rs.3.rs-81992/v1 ◽

2021 ◽

Author(s):

Fabian Braesemann ◽

Fabian Stephany ◽

Leonie Neuhäuser ◽

Niklas Stoehr ◽

Philipp Darius ◽

...

Keyword(s):

Stock Market ◽

Real Time ◽

Language Processing ◽

Economic Indicator ◽

Economic Consequences ◽

Economic Downturn ◽

Related Risk ◽

Unemployment Data ◽

Using Data ◽

Processing Techniques

Abstract The global spread of Covid-19 has caused major economic disruptions. Governments around the world provide considerable financial support to mitigate the economic downturn. However, effective policy responses require reliable data on the economic consequences of the corona pandemic. We propose the CoRisk-Index: a real-time economic indicator of Covid-19 related risk assessments by industry. Using data mining, we analyse all reports from US companies filed since January 2020, representing more than a third of all US employees. We construct two measures - the number of 'corona' words in each report and the average text negativity of the sentences mentioning corona in each industry - that are aggregated in the CoRisk-Index. The index correlates with U.S. unemployment data and preempts stock market losses of February 2020. Moreover, thanks to topic modelling and natural language processing techniques, the CoRisk data provides unique granularity with regards to the particular contexts of the crisis and the concerns of individual industries about them. The data presented here help researchers and decision makers to measure, the previously unobserved, risk awareness of industries with regard to Covid-19, bridging the quantification gap between highly volatile stock market dynamics and long-term macro-economic figures. For immediate access to the data, we provide all findings and raw data on an interactive online dashboard in real time.

Download Full-text

Early Identification of Patients with Acute Gastrointestinal Bleeding using Electronic Health Record Phenotyping

10.1101/2020.07.06.20136374 ◽

2020 ◽

Author(s):

Dennis Shung ◽

Cynthia Tsay ◽

Loren Laine ◽

Prem Thomas ◽

Caitlin Partridge ◽

...

Keyword(s):

Risk Stratification ◽

Gastrointestinal Bleeding ◽

Real Time ◽

Decision Rule ◽

Language Processing ◽

Clinical Decision Making ◽

External Validation ◽

Clinical Decision ◽

Internal Validation ◽

Electronic Health

Background and AimGuidelines recommend risk stratification scores in patients presenting with gastrointestinal bleeding (GIB), but such scores are uncommonly employed in practice. Automation and deployment of risk stratification scores in real time within electronic health records (EHRs) would overcome a major impediment. This requires an automated mechanism to accurately identify (“phenotype”) patients with GIB at the time of presentation. The goal is to identify patients with acute GIB by developing and evaluating EHR-based phenotyping algorithms for emergency department (ED) patients.MethodsWe specified criteria using structured data elements to create rules for identifying patients, and also developed a natural-language-processing (NLP)-based algorithm for automated phenotyping of patients, tested them with tenfold cross-validation (n=7144) and external validation (n=2988), and compared them with the standard method for encoding patient conditions in the EHR, Systematized Nomenclature of Medicine (SNOMED). The gold standard for GIB diagnosis was independent dual manual review of medical records. The primary outcome was positive predictive value (PPV).ResultsA decision rule using GIB-specific terms from ED triage and from ED review-of-systems assessment performed better than SNOMED on internal validation (PPV=91% [90%-93%] vs. 74% [71%-76%], P<0.001) and external validation (PPV=85% [84%-87%] vs. 69% [67%-71%], P<0.001). The NLP algorithm (external validation PPV=80% [79-82%]) was not superior to the structured-datafields decision rule.ConclusionsAn automated decision rule employing GIB-specific triage and review-of-systems terms can be used to trigger EHR-based deployment of risk stratification models to guide clinical decision-making in real time for patients with acute GIB presenting to the ED.

Download Full-text

Dynamic Engagement of Cognitive Control Modulates Recovery From Misinterpretation During Real-Time Language Processing

Psychological Science ◽

10.1177/0956797615625223 ◽

2016 ◽

Vol 27 (4) ◽

pp. 572-582 ◽

Cited By ~ 37

Author(s):

Nina S. Hsu ◽

Jared M. Novick

Keyword(s):

Cognitive Control ◽

Real Time ◽

Language Processing

Download Full-text

Wave2Vec: Vectorizing Electroencephalography Bio-Signal for Prediction of Brain Disease

International Journal of Environmental Research and Public Health ◽

10.3390/ijerph15081750 ◽

2018 ◽

Vol 15 (8) ◽

pp. 1750 ◽

Cited By ~ 4

Author(s):

Seonho Kim ◽

Jungjoon Kim ◽

Hong-Woo Chun

Keyword(s):

Artificial Intelligence ◽

Time Series ◽

Feature Selection ◽

Deep Learning ◽

Natural Language Processing ◽

Data Analysis ◽

Natural Language ◽

Real Number ◽

Real Time ◽

Language Processing

Interest in research involving health-medical information analysis based on artificial intelligence, especially for deep learning techniques, has recently been increasing. Most of the research in this field has been focused on searching for new knowledge for predicting and diagnosing disease by revealing the relation between disease and various information features of data. These features are extracted by analyzing various clinical pathology data, such as EHR (electronic health records), and academic literature using the techniques of data analysis, natural language processing, etc. However, still needed are more research and interest in applying the latest advanced artificial intelligence-based data analysis technique to bio-signal data, which are continuous physiological records, such as EEG (electroencephalography) and ECG (electrocardiogram). Unlike the other types of data, applying deep learning to bio-signal data, which is in the form of time series of real numbers, has many issues that need to be resolved in preprocessing, learning, and analysis. Such issues include leaving feature selection, learning parts that are black boxes, difficulties in recognizing and identifying effective features, high computational complexities, etc. In this paper, to solve these issues, we provide an encoding-based Wave2vec time series classifier model, which combines signal-processing and deep learning-based natural language processing techniques. To demonstrate its advantages, we provide the results of three experiments conducted with EEG data of the University of California Irvine, which are a real-world benchmark bio-signal dataset. After converting the bio-signals (in the form of waves), which are a real number time series, into a sequence of symbols or a sequence of wavelet patterns that are converted into symbols, through encoding, the proposed model vectorizes the symbols by learning the sequence using deep learning-based natural language processing. The models of each class can be constructed through learning from the vectorized wavelet patterns and training data. The implemented models can be used for prediction and diagnosis of diseases by classifying the new data. The proposed method enhanced data readability and intuition of feature selection and learning processes by converting the time series of real number data into sequences of symbols. In addition, it facilitates intuitive and easy recognition, and identification of influential patterns. Furthermore, real-time large-capacity data analysis is facilitated, which is essential in the development of real-time analysis diagnosis systems, by drastically reducing the complexity of calculation without deterioration of analysis performance by data simplification through the encoding process.

Download Full-text

Be Relevant, Non-Redundant, and Timely: Deep Reinforcement Learning for Real-Time Event Summarization

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i05.6483 ◽

2020 ◽

Vol 34 (05) ◽

pp. 9410-9417

Author(s):

Min Yang ◽

Chengming Li ◽

Fei Sun ◽

Zhou Zhao ◽

Ying Shen ◽

...

Keyword(s):

Reinforcement Learning ◽

Real Time ◽

Language Processing ◽

Semantic Features ◽

Unified Framework ◽

Time Step ◽

Dynamic Memory ◽

The Arts ◽

Real World Datasets ◽

Event Summarization

Real-time event summarization is an essential task in natural language processing and information retrieval areas. Despite the progress of previous work, generating relevant, non-redundant, and timely event summaries remains challenging in practice. In this paper, we propose a Deep Reinforcement learning framework for real-time Event Summarization (DRES), which shows promising performance for resolving all three challenges (i.e., relevance, non-redundancy, timeliness) in a unified framework. Specifically, we (i) devise a hierarchical cross-attention network with intra- and inter-document attentions to integrate important semantic features within and between the query and input document for better text matching. In addition, relevance prediction is leveraged as an auxiliary task to strengthen the document modeling and help to extract relevant documents; (ii) propose a multi-topic dynamic memory network to capture the sequential patterns of different topics belonging to the event of interest and temporally memorize the input facts from the evolving document stream, avoiding extracting redundant information at each time step; (iii) consider both historical dependencies and future uncertainty of the document stream for generating relevant and timely summaries by exploiting the reinforcement learning technique. Experimental results on two real-world datasets have demonstrated the advantages of DRES model with significant improvement in generating relevant, non-redundant, and timely event summaries against the state-of-the-arts.

Download Full-text