Ukrainian Text Preprocessing in GRAC

AbstractVarious recommender systems (RSs) have been developed over recent years, and many of them have concentrated on English content. Thus, the majority of RSs from the literature were compared on English content. However, the research investigations about RSs when using contents in other languages such as Arabic are minimal. The researchers still neglect the field of Arabic RSs. Therefore, we aim through this study to fill this research gap by leveraging the benefit of recent advances in the English RSs field. Our main goal is to investigate recent RSs in an Arabic context. For that, we firstly selected five state-of-the-art RSs devoted originally to English content, and then we empirically evaluated their performance on Arabic content. As a result of this work, we first build four publicly available large-scale Arabic datasets for recommendation purposes. Second, various text preprocessing techniques have been provided for preparing the constructed datasets. Third, our investigation derived well-argued conclusions about the usage of modern RSs in the Arabic context. The experimental results proved that these systems ensure high performance when applied to Arabic content.

Download Full-text

Compound Words as a Means of Expressing Content Tonality in the Ukrainian Text Media

10.1109/csit52700.2021.9648589 ◽

2021 ◽

Author(s):

Zoriana Haladzhun ◽

Nataliia Kunanets ◽

Paraskoviya Dvorianyn ◽

Olena Makarchuk ◽

Nataliia Veretennikova

Keyword(s):

Compound Words ◽

Ukrainian Text

Download Full-text

Text Preprocessing

Practical Text Analytics - Advances in Analytics and Data Science ◽

10.1007/978-3-319-95663-3_4 ◽

2018 ◽

pp. 45-59 ◽

Cited By ~ 2

Author(s):

Murugan Anandarajan ◽

Chelsey Hill ◽

Thomas Nolan

Keyword(s):

Text Preprocessing

Download Full-text

Studying the Effects of Text Preprocessing and Ensemble Methods on Sentiment Analysis of Brazilian Portuguese Tweets

Statistical Language and Speech Processing - Lecture Notes in Computer Science ◽

10.1007/978-3-030-00810-9_15 ◽

2018 ◽

pp. 167-177

Author(s):

Fernando Barbosa Gomes ◽

Juan Manuel Adán-Coello ◽

Fernando Ernesto Kintschner

Keyword(s):

Sentiment Analysis ◽

Ensemble Methods ◽

Brazilian Portuguese ◽

Text Preprocessing

Download Full-text

Method for Determining Linguometric Coefficient Dynamics of Ukrainian Text Content Authorship

Advances in Intelligent Systems and Computing - Advances in Intelligent Systems and Computing III ◽

10.1007/978-3-030-01069-0_10 ◽

2018 ◽

pp. 132-151 ◽

Cited By ~ 1

Author(s):

Victoria Vysotska ◽

Vitor Basto Fernandes ◽

Vasyl Lytvyn ◽

Michael Emmerich ◽

Mariya Hrendus

Keyword(s):

Ukrainian Text ◽

Text Content

Download Full-text

Zrobię to bez zwłok… the category of number in the teaching of Polish as a foreign language to Ukrainian-speaking people

Acta Universitatis Lodziensis Kształcenie Polonistyczne Cudzoziemców ◽

10.18778/0860-6587.26.26 ◽

2019 ◽

Vol 26 ◽

pp. 387-397

Author(s):

Iryna Bundza

Keyword(s):

Foreign Language ◽

Grammatical Category ◽

Text Corpus ◽

Ukrainian Text ◽

National Corpus

This article discusses the peculiarities of the category of number of Polish and Ukrainian nouns. To indicate the problem areas related to the teaching of the category of number to Ukrainian-speaking persons, the author analysed Polish and Ukrainian lexemes in terms of their fulfilments of the grammatical category of number. The article presents the contexts which may trigger errors, which in turn may cause a comical effect or distort communication. The data were collected from Polish and Ukrainian dictionaries, as well as the National Corpus of Polish and the Ukrainian Text Corpus.

Download Full-text

Study Comparison Stemmer to Optimize Text Preprocessing In Sentiment Analysis Indonesian E-Commerce Reviews

10.1109/icdabi53623.2021.9655867 ◽

2021 ◽

Author(s):

Yunita Fatma Faidha ◽

Guruh Fajar Shidik ◽

Ahmad Zainul Fanani

Keyword(s):

Sentiment Analysis ◽

Text Preprocessing

Download Full-text

Research on Classroom Evaluation Algorithm Based on CNN Text Preprocessing

Advances in Natural Computation, Fuzzy Systems and Knowledge Discovery - Advances in Intelligent Systems and Computing ◽

10.1007/978-3-030-70665-4_168 ◽

2021 ◽

pp. 1554-1561

Author(s):

Yu Liu ◽

Weidong Li ◽

Chan Wang ◽

Jie Zhao

Keyword(s):

Evaluation Algorithm ◽

Classroom Evaluation ◽

Text Preprocessing

Download Full-text

A Complete VADER-Based Sentiment Analysis of Bitcoin (BTC) Tweets during the Era of COVID-19

Big Data and Cognitive Computing ◽

10.3390/bdcc4040033 ◽

2020 ◽

Vol 4 (4) ◽

pp. 33

Author(s):

Toni Pano ◽

Rasha Kashef

Keyword(s):

Machine Learning ◽

Social Media ◽

Prediction Model ◽

Sentiment Analysis ◽

Significant Role ◽

Prediction Models ◽

Financial Sector ◽

Research Gap ◽

Text Preprocessing ◽

The Impact

During the COVID-19 pandemic, many research studies have been conducted to examine the impact of the outbreak on the financial sector, especially on cryptocurrencies. Social media, such as Twitter, plays a significant role as a meaningful indicator in forecasting the Bitcoin (BTC) prices. However, there is a research gap in determining the optimal preprocessing strategy in BTC tweets to develop an accurate machine learning prediction model for bitcoin prices. This paper develops different text preprocessing strategies for correlating the sentiment scores of Twitter text with Bitcoin prices during the COVID-19 pandemic. We explore the effect of different preprocessing functions, features, and time lengths of data on the correlation results. Out of 13 strategies, we discover that splitting sentences, removing Twitter-specific tags, or their combination generally improve the correlation of sentiment scores and volume polarity scores with Bitcoin prices. The prices only correlate well with sentiment scores over shorter timespans. Selecting the optimum preprocessing strategy would prompt machine learning prediction models to achieve better accuracy as compared to the actual prices.

Download Full-text

Ukrainian Text Preprocessing in GRAC

Differentially-Private Text Generation via Text Preprocessing to Reduce Utility Loss

Evaluation of recent advances in recommender systems on Arabic content

Compound Words as a Means of Expressing Content Tonality in the Ukrainian Text Media

Text Preprocessing

Studying the Effects of Text Preprocessing and Ensemble Methods on Sentiment Analysis of Brazilian Portuguese Tweets

Method for Determining Linguometric Coefficient Dynamics of Ukrainian Text Content Authorship

Zrobię to bez zwłok… the category of number in the teaching of Polish as a foreign language to Ukrainian-speaking people

Study Comparison Stemmer to Optimize Text Preprocessing In Sentiment Analysis Indonesian E-Commerce Reviews

Research on Classroom Evaluation Algorithm Based on CNN Text Preprocessing

A Complete VADER-Based Sentiment Analysis of Bitcoin (BTC) Tweets during the Era of COVID-19

Export Citation Format