Natural language processing systems and Big Data analytics

PurposeThe evaluation of perceived attribute performance reflected in online consumer reviews (OCRs) is critical in gaining timely marketing insights. This study proposed a text mining approach to measure consumer sentiments at the feature level and their asymmetric impacts on overall product ratings.Design/methodology/approachThis study employed 49,130 OCRs generated for 14 wireless earbud products on Amazon.com. Word combinations of the major quality dimensions and related sentiment words were identified using bigram natural language processing (NLP) analysis. This study combined sentiment dictionaries and feature-related bigrams and measured feature level sentiment scores in a review. Furthermore, the authors examined the effect of feature level sentiment on product ratings.FindingsThe results indicate that customer sentiment for product features measured from text reviews significantly and asymmetrically affects the overall rating. Building upon the three-factor theory of customer satisfaction, the key quality dimensions of wireless earbuds are categorized into basic, excitement and performance factors.Originality/valueThis study provides a novel approach to assess customer feature level evaluation of a product and its impact on customer satisfaction based on big data analytics. By applying the suggested methodology, marketing managers can gain in-depth insights into consumer needs and reflect this knowledge in their future product or service improvement.

Download Full-text

Ansätze zur quantitativen Inhaltsanalyse

WiSt - Wirtschaftswissenschaftliches Studium ◽

10.15358/0340-1650-2021-2-3-17 ◽

2021 ◽

Vol 50 (2-3) ◽

pp. 17-22

Author(s):

Johannes Brunzel

Keyword(s):

Big Data ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing

Der Beitrag erläutert, inwiefern die Methode der quantitativen Textanalyse ein wesentliches Mittel zur betriebswirtschaftlichen Effizienzsteigerung sein kann. Dabei geht der Artikel über die Nennung von Chancen und Risiken des Einsatzes von künstlicher Intelligenz/Big Data-Analysen hinaus, indem der Beitrag praxisorientiert wichtige Entwicklungen im Bereich der quantitativen Inhaltsanalyse aus der wirtschaftswissenschaftlichen Literatur herleitet. Nachfolgend unterteilt der Artikel die wichtigsten Schritte zur Implementierung in (1) Datenerhebung von quantitativen Textdaten, (2) Durchführung der generischen Textanalyse und (3) Durchführung des Natural Language Processing. Als ein Hauptergebnis hält der Artikel fest, dass Natural Language Processing-Ansätze zwar weiterführende und komplexere Einsichten bieten, jedoch das Potenzial generischer Textanalyse - aufgrund der Flexibilität und verhältnismäßig einfachen Anwendbarkeit im Unternehmenskontext - noch nicht ausgeschöpft ist. Zudem stehen Führungskräfte vor der dichotomen Entscheidung, ob programmierbasierte oder kommerzielle Lösungen für die Durchführung der Textanalyse relevant sind.

Download Full-text

Natural Language Processing in Data Analytics

Data Analytics and AI ◽

10.1201/9781003019855-7 ◽

2020 ◽

pp. 117-132

Author(s):

Yudong Liu

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Data Analytics

Download Full-text

Data Lake Ecosystem Workflow

10.21079/11681/40203 ◽

2021 ◽

Author(s):

R. Salter ◽

Quyen Dong ◽

Cody Coleman ◽

Maria Seale ◽

Alicia Ruvinsky ◽

...

Keyword(s):

Big Data ◽

Language Processing ◽

Data Analytics ◽

Large Scale ◽

Big Data Analytics ◽

Lake Ecosystem ◽

Data Governance ◽

Government Organizations ◽

Large Scale Data ◽

Scale Data

The Engineer Research and Development Center, Information Technology Laboratory’s (ERDC-ITL’s) Big Data Analytics team specializes in the analysis of large-scale datasets with capabilities across four research areas that require vast amounts of data to inform and drive analysis: large-scale data governance, deep learning and machine learning, natural language processing, and automated data labeling. Unfortunately, data transfer between government organizations is a complex and time-consuming process requiring coordination of multiple parties across multiple offices and organizations. Past successes in large-scale data analytics have placed a significant demand on ERDC-ITL researchers, highlighting that few individuals fully understand how to successfully transfer data between government organizations; future project success therefore depends on a small group of individuals to efficiently execute a complicated process. The Big Data Analytics team set out to develop a standardized workflow for the transfer of large-scale datasets to ERDC-ITL, in part to educate peers and future collaborators on the process required to transfer datasets between government organizations. Researchers also aim to increase workflow efficiency while protecting data integrity. This report provides an overview of the created Data Lake Ecosystem Workflow by focusing on the six phases required to efficiently transfer large datasets to supercomputing resources located at ERDC-ITL.

Download Full-text

Big Data and Natural Language Processing for Analysing Railway Safety

Innovative Applications of Big Data in the Railway Industry - Advances in Civil and Industrial Engineering ◽

10.4018/978-1-5225-3176-0.ch011 ◽

2018 ◽

pp. 240-267

Author(s):

Kanza Noor Syeda ◽

Syed Noorulhassan Shirazi ◽

Syed Asad Ali Naqvi ◽

Howard J Parkinson ◽

Gary Bamford

Keyword(s):

Big Data ◽

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Machine Intelligence ◽

Data Availability ◽

Accident Data ◽

Data Driven Approach ◽

Advanced Analytics ◽

The Uk

Due to modern powerful computing and the explosion in data availability and advanced analytics, there should be opportunities to use a Big Data approach to proactively identify high risk scenarios on the railway. In this chapter, we comprehend the need for developing machine intelligence to identify heightened risk on the railway. In doing so, we have explained a potential for a new data driven approach in the railway, we then focus the rest of the chapter on Natural Language Processing (NLP) and its potential for analysing accident data. We review and analyse investigation reports of railway accidents in the UK, published by the Rail Accident Investigation Branch (RAIB), aiming to reveal the presence of entities which are informative of causes and failures such as human, technical and external. We give an overview of a framework based on NLP and machine learning to analyse the raw text from RAIB reports which would assist the risk and incident analysis experts to study causal relationship between causes and failures towards the overall safety in the rail industry.

Download Full-text