A Technical Evaluation of Neo4j and Elasticsearch for Mining Twitter Data

Author(s):  
Janet Zhu ◽  
Sreenivas Sremath Tirumala ◽  
G. Anjan Babu
2020 ◽  
Vol 14 (2) ◽  
pp. 140-159
Author(s):  
Anthony-Paul Cooper ◽  
Emmanuel Awuni Kolog ◽  
Erkki Sutinen

This article builds on previous research around the exploration of the content of church-related tweets. It does so by exploring whether the qualitative thematic coding of such tweets can, in part, be automated by the use of machine learning. It compares three supervised machine learning algorithms to understand how useful each algorithm is at a classification task, based on a dataset of human-coded church-related tweets. The study finds that one such algorithm, Naïve-Bayes, performs better than the other algorithms considered, returning Precision, Recall and F-measure values which each exceed an acceptable threshold of 70%. This has far-reaching consequences at a time where the high volume of social media data, in this case, Twitter data, means that the resource-intensity of manual coding approaches can act as a barrier to understanding how the online community interacts with, and talks about, church. The findings presented in this article offer a way forward for scholars of digital theology to better understand the content of online church discourse.


Author(s):  
Samarth Jaykar Shetty ◽  
◽  
Badal Rakesh Thosani ◽  
Lenherd Deon Olivera ◽  
Supriya Kamoji ◽  
...  
Keyword(s):  

2015 ◽  
Vol 10 (2) ◽  
pp. 66
Author(s):  
Junaidi - ◽  
Ichlas Nur ◽  
Nofriadi - ◽  
Rusmardi -

Waste plastic mounting, but can be recycled into other products in the form of granules before further processed into pellets and seed injection molding process produces products such as buckets, plates, bottles and other beverages. To be processed into the required form of granules of plastic thrasher. Though so small plastic recycling industry is still constrained in plastic enumeration process because the machine used was not optimal ability. The purpose of this research is the development of the system thrasher plastic crusher and cutter cylinder-type reel and technical evaluation. This study was conducted over two years, the first year the design and manufacture of machinery, the second year is a technical evaluation of the engine, engine performance improvements and economic analysis of granular plastic products.From the results obtained engine design capacity of the machine ± 350 kg / h, the engine size is 50 cm x 120 cm x 30 cm, power motor of 10 HP at 1450 RPM rotation with 3 phase. Some of the major components of the engine that is, counter crusher unit consists of two counter rotating cylinders opposite, counter shaft size Ø 4 cm x 58 cm, blade chopper Ø 17 cm x 2 cm with the number of teeth / blades 7 pieces and the number of blades along shaft 7 pieces, buses retaining Ø 10 cm x 2 cm. Counter-cylinder unit consists of a reel-type cutter counter shaft size Ø 4 cm x 90 cm, the middle shaft mounted cylinder with Ø 17 cm x 40 cm as the holder of the chopper blades. Chopper blade consists of 4 pieces with a size of 40 cm x 2 cm x 4 cm with ASSAB materials. Furthermore, as the blade retaining bedknife shear force of the blade chopper, upper frame, lower frame, strainer, funnel entry, exit funnel, and the drive unit consists of an electric motor, reducer, belts, pulleys and 2 pieces of gear transmission. The results of performance testing machine crusher round cylinder 75 RPM and 1450 RPM reel-type cutting machine capacity ± 300 kg / h on the filter hole Ø 1.5 cm, with a 80% grain uniformity.


2003 ◽  
Author(s):  
R. M. Kwan ◽  
A. J. Yost ◽  
J. V. Santiago ◽  
A. C. Herring ◽  
M. M. Conlin
Keyword(s):  

2019 ◽  
Author(s):  
Joseph Tassone ◽  
Peizhi Yan ◽  
Mackenzie Simpson ◽  
Chetan Mendhe ◽  
Vijay Mago ◽  
...  

BACKGROUND The collection and examination of social media has become a useful mechanism for studying the mental activity and behavior tendencies of users. OBJECTIVE Through the analysis of a collected set of Twitter data, a model will be developed for predicting positively referenced, drug-related tweets. From this, trends and correlations can be determined. METHODS Twitter social media tweets and attribute data were collected and processed using topic pertaining keywords, such as drug slang and use-conditions (methods of drug consumption). Potential candidates were preprocessed resulting in a dataset 3,696,150 rows. The predictive classification power of multiple methods was compared including regression, decision trees, and CNN-based classifiers. For the latter, a deep learning approach was implemented to screen and analyze the semantic meaning of the tweets. RESULTS The logistic regression and decision tree models utilized 12,142 data points for training and 1041 data points for testing. The results calculated from the logistic regression models respectively displayed an accuracy of 54.56% and 57.44%, and an AUC of 0.58. While an improvement, the decision tree concluded with an accuracy of 63.40% and an AUC of 0.68. All these values implied a low predictive capability with little to no discrimination. Conversely, the CNN-based classifiers presented a heavy improvement, between the two models tested. The first was trained with 2,661 manually labeled samples, while the other included synthetically generated tweets culminating in 12,142 samples. The accuracy scores were 76.35% and 82.31%, with an AUC of 0.90 and 0.91. Using association rule mining in conjunction with the CNN-based classifier showed a high likelihood for keywords such as “smoke”, “cocaine”, and “marijuana” triggering a drug-positive classification. CONCLUSIONS Predictive analysis without a CNN is limited and possibly fruitless. Attribute-based models presented little predictive capability and were not suitable for analyzing this type of data. The semantic meaning of the tweets needed to be utilized, giving the CNN-based classifier an advantage over other solutions. Additionally, commonly mentioned drugs had a level of correspondence with frequently used illicit substances, proving the practical usefulness of this system. Lastly, the synthetically generated set provided increased scores, improving the predictive capability. CLINICALTRIAL None


Sign in / Sign up

Export Citation Format

Share Document