Text Emotion Distribution Learning via Multi-Task Convolutional Neural Network

Emotion analysis of on-line user generated textual content is important for natural language processing and social media analytics tasks. Most of previous emotion analysis approaches focus on identifying users’ emotional states from text by classifying emotions into one of the finite categories, e.g., joy, surprise, anger and fear. However, there exists ambiguity characteristic for the emotion analysis, since a single sentence can evoke multiple emotions with different intensities. To address this problem, we introduce emotion distribution learning and propose a multi-task convolutional neural network for text emotion analysis. The end-to-end framework optimizes the distribution prediction and classification tasks simultaneously, which is able to learn robust representations for the distribution dataset with annotations of different voters. While most work adopt the majority voting scheme for the ground truth labeling, we also propose a lexiconbased strategy to generate distributions from a single label, which provides prior information for the emotion classification. Experiments conducted on five public text datasets (i.e., SemEval, Fairy Tales, ISEAR, TEC, CBET) demonstrate that our proposed method performs favorably against the state-of-the-art approaches.

Download Full-text

Joint Image Emotion Classification and Distribution Learning via Deep Convolutional Neural Network

Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2017/456 ◽

2017 ◽

Cited By ~ 26

Author(s):

Jufeng Yang ◽

Dongyu She ◽

Ming Sun

Keyword(s):

Neural Network ◽

Neural Networks ◽

Prior Knowledge ◽

State Of The Art ◽

Ground Truth ◽

Majority Voting ◽

Emotion Classification ◽

Voting Scheme ◽

Label Distribution ◽

Similarity Information

Visual sentiment analysis is attracting more and more attention with the increasing tendency to express emotions through visual contents. Recent algorithms in convolutional neural networks (CNNs) considerably advance the emotion classification, which aims to distinguish differences among emotional categories and assigns a single dominant label to each image. However, the task is inherently ambiguous since an image usually evokes multiple emotions and its annotation varies from person to person. In this work, we address the problem via label distribution learning (LDL) and develop a multi-task deep framework by jointly optimizing both classification and distribution prediction. While the proposed method prefers to the distribution dataset with annotations of different voters, the majority voting scheme is widely adopted as the ground truth in this area, and few dataset has provided multiple affective labels. Hence, we further exploit two weak forms of prior knowledge, which are expressed as similarity information between labels, to generate emotional distribution for each category. The experiments conducted on both distribution datasets, i.e., Emotion6, Flickr_LDL, Twitter_LDL, and the largest single emotion dataset, i.e., Flickr and Instagram, demonstrate the proposed method outperforms the state-of-the-art approaches.

Download Full-text

Enhanced Weight Based Convolutional Neural Network (EWCNN) and Fuzzy Clustering For Semantically Rich Multi-Label Social Emotion Classification

International Journal of Innovative Technology and Exploring Engineering - Special Issue ◽

10.35940/ijitee.l3644.1081219 ◽

2019 ◽

Vol 8 (12) ◽

pp. 4112-4121

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Language Processing ◽

Fuzzy Clustering ◽

Large Scale ◽

Clustering Algorithm ◽

Novel Mutation ◽

Experimental Result ◽

Emotion Classification ◽

Social Emotion

In Recent Years, Social Emotion In Recent Years Acquires Natural Language Processing Researchers’ Attention, Because Of Analyzing User-Generated Emotional Documents On The Web. But, These Emotions Has Noisy Instance Mixed And It Is Great Dispute To Acquire The Textual Meaning Of Short Messages. Definition: In General, Large-Scale Datasets Will Have Many Noisy Data, Which Can’t Be Used Readily And Also It Is Costly, Because Of Ambiguity Of Various Informal Expressions In User-Generated Comments. It Is Very Tedious One To Recognize The Similar User Documents From The Entire Social Media Text Message. Furthermore, Online Comments Are Characteristically Categorized By A Sparse Feature Space, Which Makes The Respective Emotion Classification Task A Complex One. Methodology: Three Major Contributions Were Done In This Work In Order To Rectify These Problems, They Are: Development Of A Novel Mutation Bat Optimization Based Sparse Encoding (MBO-SC) Which Transforming The Sparse Low-Level Features Into Dense HighLevel Features, Was The 1st Contribution, Next Is, An Enhanced Weight Based Convolutional Neural Network (EWCNN) To Target-Specific Layer. It Influences The Semantically EWCNN Classifier To Include Semantic Domain Knowledge Into The Neural Network To Bootstrap Its Inference Power And Interpretability. Fuzzy Clustering Algorithm Is Proposed To Minimize The Similarity Among Two Documents. Uses: It Is Quite Constructive In Recommending Products, Collecting Public Opinions, And Predicting Election Results. Proposed Work Is Distinguished With The Existing Methods, With The Metrics Such As: Precision, Recall, Sensitivity, Specificity, FMeasure And Accuracy. From The Experimental Result It Is Confirmed That The Quality Of Learned Semantic Vectors And The Performance Of Social Emotion Classification Can Be Enhanced By Proposed Models.

Download Full-text

Carpal Bone Segmentation Using Fully Convolutional Neural Network

Current Medical Imaging Formerly Current Medical Imaging Reviews ◽

10.2174/1573405615666190724101600 ◽

2019 ◽

Vol 15 (10) ◽

pp. 983-989

Author(s):

Liang Kim Meng ◽

Azira Khalil ◽

Muhamad Hanif Ahmad Nizar ◽

Maryam Kamarun Nisham ◽

Belinda Pingguan-Murphy ◽

...

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Automatic Segmentation ◽

Ground Truth ◽

Bone Age ◽

Image Feature ◽

Qualitative Assessment ◽

Carpal Bone ◽

Radiographic Images ◽

Accurate Quantitative Analysis

Background: Bone Age Assessment (BAA) refers to a clinical procedure that aims to identify a discrepancy between biological and chronological age of an individual by assessing the bone age growth. Currently, there are two main methods of executing BAA which are known as Greulich-Pyle and Tanner-Whitehouse techniques. Both techniques involve a manual and qualitative assessment of hand and wrist radiographs, resulting in intra and inter-operator variability accuracy and time-consuming. An automatic segmentation can be applied to the radiographs, providing the physician with more accurate delineation of the carpal bone and accurate quantitative analysis. Methods: In this study, we proposed an image feature extraction technique based on image segmentation with the fully convolutional neural network with eight stride pixel (FCN-8). A total of 290 radiographic images including both female and the male subject of age ranging from 0 to 18 were manually segmented and trained using FCN-8. Results and Conclusion: The results exhibit a high training accuracy value of 99.68% and a loss rate of 0.008619 for 50 epochs of training. The experiments compared 58 images against the gold standard ground truth images. The accuracy of our fully automated segmentation technique is 0.78 ± 0.06, 1.56 ±0.30 mm and 98.02% in terms of Dice Coefficient, Hausdorff Distance, and overall qualitative carpal recognition accuracy, respectively.

Download Full-text

Geometric property-based convolutional neural network for indoor object detection

International Journal of Advanced Robotic Systems ◽

10.1177/1729881421993323 ◽

2021 ◽

Vol 18 (1) ◽

pp. 172988142199332

Author(s):

Xintao Ding ◽

Boquan Li ◽

Jinbao Wang

Keyword(s):

Neural Network ◽

Object Detection ◽

Convolutional Neural Network ◽

Geometric Property ◽

Ground Truth ◽

Geometric Constraints ◽

Depth Information ◽

Training Set ◽

Object Knowledge ◽

The Mean

Indoor object detection is a very demanding and important task for robot applications. Object knowledge, such as two-dimensional (2D) shape and depth information, may be helpful for detection. In this article, we focus on region-based convolutional neural network (CNN) detector and propose a geometric property-based Faster R-CNN method (GP-Faster) for indoor object detection. GP-Faster incorporates geometric property in Faster R-CNN to improve the detection performance. In detail, we first use mesh grids that are the intersections of direct and inverse proportion functions to generate appropriate anchors for indoor objects. After the anchors are regressed to the regions of interest produced by a region proposal network (RPN-RoIs), we then use 2D geometric constraints to refine the RPN-RoIs, in which the 2D constraint of every classification is a convex hull region enclosing the width and height coordinates of the ground-truth boxes on the training set. Comparison experiments are implemented on two indoor datasets SUN2012 and NYUv2. Since the depth information is available in NYUv2, we involve depth constraints in GP-Faster and propose 3D geometric property-based Faster R-CNN (DGP-Faster) on NYUv2. The experimental results show that both GP-Faster and DGP-Faster increase the performance of the mean average precision.

Download Full-text

Research on Inversion Mechanism of Chlorophyll—A Concentration in Water Bodies Using a Convolutional Neural Network Model

Water ◽

10.3390/w13050664 ◽

2021 ◽

Vol 13 (5) ◽

pp. 664

Author(s):

Yun Xue ◽

Lei Zhu ◽

Bin Zou ◽

Yi-min Wen ◽

Yue-hong Long ◽

...

Keyword(s):

Neural Network ◽

Regression Model ◽

Convolutional Neural Network ◽

Chlorophyll A ◽

Language Processing ◽

Water Bodies ◽

Inversion Effect ◽

Least Squares Regression ◽

Chlorophyll A Concentration ◽

Chl A

For Case-II water bodies with relatively complex water qualities, it is challenging to establish a chlorophyll-a concentration (Chl-a concentration) inversion model with strong applicability and high accuracy. Convolutional Neural Network (CNN) shows excellent performance in image target recognition and natural language processing. However, there little research exists on the inversion of Chl-a concentration in water using convolutional neural networks. Taking China’s Dongting Lake as an example, 90 water samples and their spectra were collected in this study. Using eight combinations as independent variables and Chl-a concentration as the dependent variable, a CNN model was constructed to invert Chl-a concentration. The results showed that: (1) The CNN model of the original spectrum has a worse inversion effect than the CNN model of the preprocessed spectrum. The determination coefficient (RP2) of the predicted sample is increased from 0.79 to 0.88, and the root mean square error (RMSEP) of the predicted sample is reduced from 0.61 to 0.49, indicating that preprocessing can significantly improve the inversion effect of the model.; (2) among the combined models, the CNN model with Baseline1_SC (strong correlation factor of 500–750 nm baseline) has the best effect, with RP2 reaching 0.90 and RMSEP only 0.45. The average inversion effect of the eight CNN models is better. The average RP2 reaches 0.86 and the RMSEP is only 0.52, indicating the feasibility of applying CNN to Chl-a concentration inversion modeling; (3) the performance of the CNN model (Baseline1_SC (RP2 = 0.90, RMSEP = 0.45)) was far better than the traditional model of the same combination, i.e., the linear regression model (RP2 = 0.61, RMSEP = 0.72) and partial least squares regression model (Baseline1_SC (RP2 = 0.58. RMSEP = 0.95)), indicating the superiority of the convolutional neural network inversion modeling of water body Chl-a concentration.

Download Full-text

Unraveling the deep learning gearbox in optical coherence tomography image segmentation towards explainable artificial intelligence

Communications Biology ◽

10.1038/s42003-021-01697-y ◽

2021 ◽

Vol 4 (1) ◽

Author(s):

Peter M. Maloca ◽

Philipp L. Müller ◽

Aaron Y. Lee ◽

Adnan Tufail ◽

Konstantinos Balaskas ◽

...

Keyword(s):

Neural Network ◽

Machine Learning ◽

Optical Coherence Tomography ◽

Image Segmentation ◽

Convolutional Neural Network ◽

Learning Algorithm ◽

Ground Truth ◽

Optical Coherence Tomography Image ◽

Optical Coherence ◽

Tomography Image

AbstractMachine learning has greatly facilitated the analysis of medical data, while the internal operations usually remain intransparent. To better comprehend these opaque procedures, a convolutional neural network for optical coherence tomography image segmentation was enhanced with a Traceable Relevance Explainability (T-REX) technique. The proposed application was based on three components: ground truth generation by multiple graders, calculation of Hamming distances among graders and the machine learning algorithm, as well as a smart data visualization (‘neural recording’). An overall average variability of 1.75% between the human graders and the algorithm was found, slightly minor to 2.02% among human graders. The ambiguity in ground truth had noteworthy impact on machine learning results, which could be visualized. The convolutional neural network balanced between graders and allowed for modifiable predictions dependent on the compartment. Using the proposed T-REX setup, machine learning processes could be rendered more transparent and understandable, possibly leading to optimized applications.

Download Full-text

Towards Accurate Deceptive Opinions Detection Based on Word Order-Preserving CNN

Mathematical Problems in Engineering ◽

10.1155/2018/2410206 ◽

2018 ◽

Vol 2018 ◽

pp. 1-9 ◽

Cited By ~ 4

Author(s):

Siyuan Zhao ◽

Zhiwei Xu ◽

Limin Liu ◽

Mengjie Guo ◽

Jing Yun

Keyword(s):

Neural Network ◽

Natural Language Processing ◽

Natural Language ◽

Convolutional Neural Network ◽

Language Processing ◽

Word Order ◽

Text Analysis ◽

Important Application ◽

Detection Mechanism ◽

Short Text

Convolutional neural network (CNN) has revolutionized the field of natural language processing, which is considerably efficient at semantics analysis that underlies difficult natural language processing problems in a variety of domains. The deceptive opinion detection is an important application of the existing CNN models. The detection mechanism based on CNN models has better self-adaptability and can effectively identify all kinds of deceptive opinions. Online opinions are quite short, varying in their types and content. In order to effectively identify deceptive opinions, we need to comprehensively study the characteristics of deceptive opinions and explore novel characteristics besides the textual semantics and emotional polarity that have been widely used in text analysis. In this paper, we optimize the convolutional neural network model by embedding the word order characteristics in its convolution layer and pooling layer, which makes convolutional neural network more suitable for short text classification and deceptive opinions detection. The TensorFlow-based experiments demonstrate that the proposed detection mechanism achieves more accurate deceptive opinion detection results.

Download Full-text

Emergence of a compositional neural code for written words: Recycling of a convolutional neural network for reading

Proceedings of the National Academy of Sciences ◽

10.1073/pnas.2104779118 ◽

2021 ◽

Vol 118 (46) ◽

pp. e2104779118

Author(s):

T. Hannagan ◽

A. Agrawal ◽

L. Cohen ◽

S. Dehaene

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Language Processing ◽

Visual Pathway ◽

Neural Code ◽

Letter Recognition ◽

Invariant Representation ◽

Spoken Language Processing ◽

Visual Word Form Area ◽

Ventral Visual Pathway

The visual word form area (VWFA) is a region of human inferotemporal cortex that emerges at a fixed location in the occipitotemporal cortex during reading acquisition and systematically responds to written words in literate individuals. According to the neuronal recycling hypothesis, this region arises through the repurposing, for letter recognition, of a subpart of the ventral visual pathway initially involved in face and object recognition. Furthermore, according to the biased connectivity hypothesis, its reproducible localization is due to preexisting connections from this subregion to areas involved in spoken-language processing. Here, we evaluate those hypotheses in an explicit computational model. We trained a deep convolutional neural network of the ventral visual pathway, first to categorize pictures and then to recognize written words invariantly for case, font, and size. We show that the model can account for many properties of the VWFA, particularly when a subset of units possesses a biased connectivity to word output units. The network develops a sparse, invariant representation of written words, based on a restricted set of reading-selective units. Their activation mimics several properties of the VWFA, and their lesioning causes a reading-specific deficit. The model predicts that, in literate brains, written words are encoded by a compositional neural code with neurons tuned either to individual letters and their ordinal position relative to word start or word ending or to pairs of letters (bigrams).

Download Full-text

Emotion Classification on Indonesian Twitter Using Convolutional Neural Network (CNN)

10.1109/icoict52021.2021.9527447 ◽

2021 ◽

Author(s):

Firhan Maulana Rusli ◽

Rita Rismala ◽

Hani Nurrahmi

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Emotion Classification

Download Full-text

An Improved Cross-Domain Sentiment Analysis Based on a Semi-Supervised Convolutional Neural Network

10.4018/978-1-7998-8413-2.ch007 ◽

2022 ◽

pp. 155-170

Author(s):

Lap-Kei Lee ◽

Kwok Tai Chui ◽

Jingjing Wang ◽

Yin-Chun Fung ◽

Zhanhui Tan

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Sentiment Analysis ◽

Language Processing ◽

Research Work ◽

Latent Semantic Indexing ◽

Future Research ◽

Training Time ◽

Cross Domain ◽

Document Frequency

The dependence on Internet in our daily life is ever-growing, which provides opportunity to discover valuable and subjective information using advanced techniques such as natural language processing and artificial intelligence. In this chapter, the research focus is a convolutional neural network for three-class (positive, neutral, and negative) cross-domain sentiment analysis. The model is enhanced in two-fold. First, a similarity label method facilitates the management between the source and target domains to generate more labelled data. Second, term frequency-inverse document frequency (TF-IDF) and latent semantic indexing (LSI) are employed to compute the similarity between source and target domains. Performance evaluation is conducted using three datasets, beauty reviews, toys reviews, and phone reviews. The proposed method enhances the accuracy by 4.3-7.6% and reduces the training time by 50%. The limitations of the research work have been discussed, which serve as the rationales of future research directions.

Download Full-text