Deep recurrent neural network for geographical entities disambiguation on social media data

2019 ◽  
Vol 173 ◽  
pp. 117-127 ◽  
Author(s):  
Cristina Zuheros ◽  
Siham Tabik ◽  
Ana Valdivia ◽  
Eugenio Martínez-Cámara ◽  
Francisco Herrera
10.2196/13076 ◽  
2019 ◽  
Vol 6 (12) ◽  
pp. e13076
Author(s):  
Sara Melvin ◽  
Amanda Jamal ◽  
Kaitlyn Hill ◽  
Wei Wang ◽  
Sean D Young

Background Social media data can be explored as a tool to detect sleep deprivation. First-year undergraduate students in their first quarter were invited to wear sleep-tracking devices (Basis; Intel), allow us to follow them on Twitter, and complete weekly surveys regarding their sleep. Objective This study aimed to determine whether social media data can be used to monitor sleep deprivation. Methods The sleep data obtained from the device were utilized to create a tiredness model that aided in labeling the tweets as sleep deprived or not at the time of posting. Labeled data were used to train and test a gated recurrent unit (GRU) neural network as to whether or not study participants were sleep deprived at the time of posting. Results Results from the GRU neural network suggest that it is possible to classify the sleep-deprivation status of a tweet’s author with an average area under the curve of 0.68. Conclusions It is feasible to use social media to identify students’ sleep deprivation. The results add to the body of research suggesting that social media data should be further explored as a potential source for monitoring health.


2016 ◽  
Vol 2016 ◽  
pp. 1-13 ◽  
Author(s):  
Helena Gómez-Adorno ◽  
Ilia Markov ◽  
Grigori Sidorov ◽  
Juan-Pablo Posadas-Durán ◽  
Miguel A. Sanchez-Perez ◽  
...  

We introduce a lexical resource for preprocessing social media data. We show that a neural network-based feature representation is enhanced by using this resource. We conducted experiments on the PAN 2015 and PAN 2016 author profiling corpora and obtained better results when performing the data preprocessing using the developed lexical resource. The resource includes dictionaries of slang words, contractions, abbreviations, and emoticons commonly used in social media. Each of the dictionaries was built for the English, Spanish, Dutch, and Italian languages. The resource is freely available.


Symmetry ◽  
2019 ◽  
Vol 11 (11) ◽  
pp. 1408 ◽  
Author(s):  
Yichun Xu ◽  
Chen Wang ◽  
Zhiping Dan ◽  
Shuifa Sun ◽  
Fangmin Dong

Social media makes it easy for individuals to publish and consume news, but it also facilitates the spread of rumors. This paper proposes a novel deep recurrent neural model with a symmetrical network architecture for automatic rumor detection in social media such as Sina Weibo, which shows better performance than the existing methods. In the data preparing phase, we filter the posts according to the followers of the user. We then use sequential encoding for the posts and multiple embedding layers to get better feature representation, and multiple recurrent neural network layers to capture the dynamic temporal signals characteristic. The experimental results on the Sina Weibo dataset show that: 1. the sequential encoding performs better than the term frequency-inverse document frequency (TF-IDF) or the doc2vec encoding scheme; 2. the model is more accurate when trained on the posts from the users with more followers; and 3. the model achieves superior improvements over the existing works on the accuracy of detection, including the early detection.


2018 ◽  
Author(s):  
Sara Melvin ◽  
Amanda Jamal ◽  
Kaitlyn Hill ◽  
Wei Wang ◽  
Sean D Young

BACKGROUND Social media data can be explored as a tool to detect sleep deprivation. First-year undergraduate students in their first quarter were invited to wear sleep-tracking devices (Basis; Intel), allow us to follow them on Twitter, and complete weekly surveys regarding their sleep. OBJECTIVE This study aimed to determine whether social media data can be used to monitor sleep deprivation. METHODS The sleep data obtained from the device were utilized to create a tiredness model that aided in labeling the tweets as sleep deprived or not at the time of posting. Labeled data were used to train and test a gated recurrent unit (GRU) neural network as to whether or not study participants were sleep deprived at the time of posting. RESULTS Results from the GRU neural network suggest that it is possible to classify the sleep-deprivation status of a tweet’s author with an average area under the curve of 0.68. CONCLUSIONS It is feasible to use social media to identify students’ sleep deprivation. The results add to the body of research suggesting that social media data should be further explored as a potential source for monitoring health.


2021 ◽  
Vol 2021 ◽  
pp. 1-13
Author(s):  
Fazeel Abid ◽  
Ikram Ud Din ◽  
Ahmad Almogren ◽  
Hasan Ali Khattak ◽  
Mirza Waqar Baig

Deep learning-based methodologies are significant to perform sentiment analysis on social media data. The valuable insights of social media data through sentiment analysis can be employed to develop intelligent applications. Among many networks, convolution neural networks (CNNs) are widely used in many conventional text classification tasks and perform a significant role. However, to capture long-term contextual information and address the detail loss problem, CNNs require stacking multiple convolutional layers. Also, the stacking of convolutional layers has issues requiring massive computations and the tuning of additional parameters. To solve these problems, in this paper, a contextualized concatenated word representation (CCWRs) is initialized from social media data based on text which is essential to misspelled and out of vocabulary words (OOV). In CCWRs, different word representation models, for example, Word2Vec, its optimized version FastText and Global Vectors, and GloVe, collectively create contextualized representations upon the sequence of input. Second, a three-layered dilated convolutional neural network (3D-CNN) is proposed that places dilated convolution kernels instead of conventional CNN kernels. Incorporating the extension in the receptive field’s size successfully solves the detail loss problem and achieves long-term context information with different dilation rates. Experiments on datasets demonstrate that the proposed framework achieves reliable results with the selection of numerous hyperparameter tuning and configurations for improved optimization leads to reduced computational resources and reliable accuracy.


Sign in / Sign up

Export Citation Format

Share Document