Textual data dimensionality reduction - a deep learning approach

In recent years, increasing attention is being paid to sentiment analysis on microblogging platforms such as Twitter. Sentiment analysis refers to the task of detecting whether a textual item (e.g., a tweet) contains an opinion about a topic. This paper proposes a probabilistic deep learning approach for sentiments analysis. The deep learning model used is a convolutional neural network (CNN). The main contribution of this approach is a new probabilistic representation of the text to be fed as input to the CNN. This representation is a matrix that stores for each word composing the message the probability that it belongs to a positive class and the probability that it belongs to a negative class. The proposed approach is evaluated on four well-known datasets HCR, OMD, STS-gold, and a dataset provided by the SemEval-2017 Workshop. The results of the experiments show that the proposed approach competes with the state-of-the-art sentiment analyzers and has the potential to detect sentiments from textual data in an effective manner.

Download Full-text

Deep learning approach based on dimensionality reduction for designing electromagnetic nanostructures

npj Computational Materials ◽

10.1038/s41524-020-0276-y ◽

2020 ◽

Vol 6 (1) ◽

Cited By ~ 30

Author(s):

Yashar Kiarashinejad ◽

Sajjad Abdollahramezani ◽

Ali Adibi

Keyword(s):

Deep Learning ◽

Dimensionality Reduction ◽

Learning Approach

Download Full-text

Microblog Dimensionality Reduction—A Deep Learning Approach

IEEE Transactions on Knowledge and Data Engineering ◽

10.1109/tkde.2016.2540639 ◽

2016 ◽

Vol 28 (7) ◽

pp. 1779-1789 ◽

Cited By ~ 12

Author(s):

Lei Xu ◽

Chunxiao Jiang ◽

Yong Ren ◽

Hsiao-Hwa Chen

Keyword(s):

Deep Learning ◽

Dimensionality Reduction ◽

Learning Approach

Download Full-text

Combining time-series and textual data for taxi demand prediction in event areas: A deep learning approach

Information Fusion ◽

10.1016/j.inffus.2018.07.007 ◽

2019 ◽

Vol 49 ◽

pp. 120-129 ◽

Cited By ~ 26

Author(s):

Filipe Rodrigues ◽

Ioulia Markou ◽

Francisco C. Pereira

Keyword(s):

Time Series ◽

Deep Learning ◽

Learning Approach ◽

Demand Prediction ◽

Textual Data

Download Full-text

Image or Text: Which One is More Influential? A Deep Learning Approach for Visual and Textual Data Analysis in Digital Economy

Communications of the Association for Information Systems ◽

10.17705/1cais.04708 ◽

2020 ◽

Vol 47 (1) ◽

pp. 165-188

Author(s):

Ying Wang ◽

◽

Jaeki Song ◽

Keyword(s):

Deep Learning ◽

Data Analysis ◽

Digital Economy ◽

Learning Approach ◽

Textual Data

Download Full-text

Traditional Dimensionality Reduction Techniques Using Deep Learning

International Journal of Recent Technology and Engineering - 2 ◽

10.35940/ijrte.c6110.098319 ◽

2019 ◽

Vol 8 (3) ◽

pp. 7153-7160

Keyword(s):

Deep Learning ◽

Dimensionality Reduction ◽

Principal Component ◽

Random Projection ◽

High Dimensions ◽

Quality Performance ◽

Reduction Techniques ◽

Data Dimensionality Reduction ◽

Dimensionality Reduction Techniques ◽

Non Negative Matrix Factorization

From the analysis of big data, dimensionality reduction techniques play a significant role in various fields where the data is huge with multiple columns or classes. Data with high dimensions contains thousands of features where many of these features contain useful information. Along with this there contains a lot of redundant or irrelevant features which reduce the quality, performance of data and decrease the efficiency in computation. Procedures which are done mathematically for reducing dimensions are known as dimensionality reduction techniques. The main aim of the Dimensionality Reduction algorithms such as Principal Component Analysis (PCA), Random Projection (RP) and Non Negative Matrix Factorization (NMF) is used to decrease the inappropriate information from the data and moreover the features and attributes taken from these algorithms were not able to characterize data as different divisions. This paper gives a review about the traditional methods used in Machine algorithm for reducing the dimension and proposes a view, how deep learning can be used for dimensionality reduction.

Download Full-text

Dimensionality Reduction for Network Anomalies Detection: A Deep Learning Approach

Advances in Intelligent Systems and Computing - Web, Artificial Intelligence and Network Applications ◽

10.1007/978-3-030-15035-8_94 ◽

2019 ◽

pp. 957-965

Author(s):

Ahmed Dawoud ◽

Seyed Shahristani ◽

Chun Raun

Keyword(s):

Deep Learning ◽

Dimensionality Reduction ◽

Learning Approach ◽

Network Anomalies

Download Full-text

Comparison of various Activation Functions A Deep Learning Approach

International Journal of Computer Sciences and Engineering ◽

10.26438/ijcse/v6i3.122126 ◽

2018 ◽

Vol 6 (3) ◽

pp. 122-126

Author(s):

Mohammed Ibrahim Khan ◽

◽

Akansha Singh ◽

Anand Handa ◽

◽

...

Keyword(s):

Deep Learning ◽

Learning Approach ◽

Activation Functions

Download Full-text

A Deep Learning based Arabic Script Recognition System: Benchmark on KHAT

The International Arab Journal of Information Technology ◽

10.34028/iajit/17/3/3 ◽

2020 ◽

Vol 17 (3) ◽

pp. 299-305 ◽

Cited By ~ 1

Author(s):

Riaz Ahmad ◽

Saeeda Naz ◽

Muhammad Afzal ◽

Sheikh Rashid ◽

Marcus Liwicki ◽

...

Keyword(s):

Deep Learning ◽

Character Recognition ◽

Data Augmentation ◽

Short Term Memory ◽

Recognition System ◽

Learning Approach ◽

Arabic Text ◽

Data Set ◽

Processing Step ◽

Handwritten Arabic

This paper presents a deep learning benchmark on a complex dataset known as KFUPM Handwritten Arabic TexT (KHATT). The KHATT data-set consists of complex patterns of handwritten Arabic text-lines. This paper contributes mainly in three aspects i.e., (1) pre-processing, (2) deep learning based approach, and (3) data-augmentation. The pre-processing step includes pruning of white extra spaces plus de-skewing the skewed text-lines. We deploy a deep learning approach based on Multi-Dimensional Long Short-Term Memory (MDLSTM) networks and Connectionist Temporal Classification (CTC). The MDLSTM has the advantage of scanning the Arabic text-lines in all directions (horizontal and vertical) to cover dots, diacritics, strokes and fine inflammation. The data-augmentation with a deep learning approach proves to achieve better and promising improvement in results by gaining 80.02% Character Recognition (CR) over 75.08% as baseline.

Download Full-text